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MAMMALIAN gb2 GABA„ RECEPTORS 

Field of The Invention 

The present invention relates to a new subtype of mammalian GABA Q 
receptor. This subtype has been designated gb2 GABA Q . In particular, the present 
invention relates to nucleic acids (DNA) encoding gb2 GABA a . to vectors and host 
expression systems containing these nucleic acids, to isolated and recombinantly- 
produced antigens encoded by these nucleic acids, and to antibodies produced against 
these antigens. 



Background 

GABA (Y-aminobutyric acid or 4-aminobutanoic acid) is the major inhibitory 
neurotransmitter in the mammalian brain. GABAergic inhibitory synaptic 
transmission is mediated by two distinct types of receptors, ligand-gated chloride 
channels and G-protein-coupled receptors. The muiti-subunit GABA A receptors 
constitute the ligand-gated channels that when activated allow the passage of chloride 
ions into the cell resulting in hyperpolarization and cessation of neurotransmission. 
GABA B receptors, which are abundantly and heterogeneousiy distributed throughout 
the central nervous system, are members of the G-protein-coupled receptor 
superfamily and mediate a variety of inhibitory cellular processes through pertussis 
toxin-sensitive G-proteins G n and/or G ; . The G-proteins (i.e., guanine nucleotide 
binding proteins) allow functional coupling of the GABA B receptors to Ca ik and K fc 
channels. Activation of GABA B receptors has been shown to reduce evoked 
neurotransmitter release through modulation of voltage-sensitive Ca* and/or K' 
currents, hyperpolarizc a postsynaptic cell through activation of K' currents, and 
regulate adenylyl cyclase activity (see, for example, review articles by Bowery. Annu. 
Rev. Pharmacol. Toxicol. 33. 109-147 (1993); Kerr et aL Pharmac. Ther. 67. 187-246 
(1995); Misgeid et aL Prog. Neurobiol. 46. 423-462 (1995)). GABA Q receptors were 
originally identified due to their insensitivity to the GABA A antagonist bicuculline 
and their sensitivity to the agonist baclofen (Bowery et aL Eur. J, Pharmacol. 71. 53- 
70 (1981); Hill et aL Nature 290. 149-152 (1981 )). 
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Second messenger and electrophysiological studies of pre- and postsynaptic 
GABA„ receptors suygest that there may be as many as tour receptor subtypes 
(Gemignani et al.. Mol. Pharmacol. 46. 558-562 ( 1 994); Cunningham et al.. Brain 
Res. 720, 220-224 (1996): Bonanno et al.. Trends Pharmacol. Set. 14. 259-261 
5 ( 1 993)). Presynaptic GAB A„ autoreceptors have been shown to modulate GAB A 
release, while neuroreceptors modulate the release of several neurotransmitters and 
peptides including L-glutamate. somatostatin, substance P. and cholecystokinin 
(Huston et al.. Neurosci. 68. 465-478 (1995); Pende et al.. Brain Res. 604. 325-330 
(1993); Bonanno et al.. Br. J. Pharmacol. 1 18, 1441-1446 (1996); Teoh et al., Br. J. 
) Pharmacol. 118. H 53- 1 160 (1996); Raiteri et al.. J. Pharmacol. Exp. Therapeut. 278. 
747-751 (1996)). While the study of GABA A receptor structure and function has been 
advanced greatly by knowledge of their primary sequences, studies of the G AB A B 
receptor have only recently begun. Kaupmann et al. (Nature 386. 239-246 (1997)) 
have described a sequence for a GABA B receptor which was isolated by expression 
cloning techniques. The identified receptors, rgbla and rglb. are members of the G 
protein-coupled seven transmembrane domain receptor family and are splice variants 
of the same receptor. These receptors exhibit the expected sensitivity and rank order 
of potency of GABA„ receptor compounds that has been described in the literature for 
GABA 3 receptors in tissue preparations. 

Summary of the Invention 

The present invention relates to a new subtype of mammalian GABA B receptor 
which has been designated gb2 GABA B . In particular, the present invention relates to 
nucleic acids (DNA) encoding gb2 GABA B , to vectors and host expression systems 
containing these nucleic acids, to isolated and purified gb2 GABA B receptors, to 
isolated and recombinantly-produced antigens encoded by these nucleic acids, and to 
antibodies produced against these antigens. Using expressed sequence tag (EST) data 
base searches, polymerase chain reaction, and library screening, we have identified 
and isolated human and rat cDNAs for a novel GABA B receptors, designated hgb2 
and rgb2. with homology to rgb la and rgb lb (the two alternatively spliced forms of 
gbl). 
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This is only the second GABA,, receptor primary sequence to be identified. 
The homology between rnt receptor sequences (rgbl and rgb2) and human receptor 
sequence (hgb2 ) indicates that they might be members of a larger GABA„ receptor 
family. These receptors (i.e.. hgb2. rgb2. and rgbl) comprise a new subfamily of 

5 seven transmembrane G-protein-coupled receptors that share structure and sequence 
similarities with the metabotropic giutamate receptors. Baclofen and GABA 
modulate cAMP synthesis through the gb2 receptors at u.M concentrations. The rgb2 
mRNA exhibits a more discrete expression pattern in rat brain than rgbl and appears 
to be absent from white matter suggesting that it is predominantly localized to 

0 neurons. 

One object of the invention is to provide a nucleic acid sequence that encodes 
a protein which comprises a mammalian gb2 GABA,, receptor. A further object of the 
invention is to provide a nucleic acid sequence that encodes a protein which comprises 
a human gb2 GABA„ receptor. 

A further object of the invention is to provide constructs containing the genes 
encoding such receptors under the transcriptional and translational regulatory control 
of regulator genes recognized by a desired host to which the mammalian gb2 
GABA,, receptor genes are foreign. 

A still further object of the invention is to provide an isolated or pure clone 
from amammalian tissue DNA library, wherein the clone contains a DNA sequence 
that encodes a mammalian gb2 GABA„ receptor. Another object of the invention is to 
provide an isolated or pure clone from a human tissue DNA library, wherein the clone 
contains a DNA sequence that encodes a human gb2 GABA B receptor. 

Another object of the invention is to provide an expression vector that contains 
the DNA of the invention and is capable of transforming a host cell. 

Still another object of the invention is to provide host cells and have been 
transformed with such DNA and which express a mammalian gb2 GABA Q receptor. 

Another object of the invention is to provide isolated or purified proteins 
comprising mammalian gb2 GABA n receptors. Still another object of the invention is 
to provide an isolated or purified protein comprising human gb2 GABA a receptor. . 

Yet another object of the invention is to provide methods for obtaining a DNA 
sequence encoding a mammalian gb2 GABA B receptor, methods for preparing a cell 
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thai expresses the receptor, and methods for using such cells in the pharmacological, 
physiological, functional, or other investigational analysis of GABA. GABA,» 
receptors, and agonists or antagonists for GABA n receptors. 

To achieve the objects and in accordance with the purpose of the invention, as 

5 embodied and broadly described herein, isolated or essentially pure DNA sequences 
encoding mammalian gb2 GABA B receptors are disclosed herein. Preferably, the 
DNA sequences are cDNA sequences, and most preferably they are the cDNA 
sequences shown in SEQ. ID NO.: 1 (human) and SEQ. ID NO.: 2 (rat) or allelic 
variants thereof. Preferably the mammalian gb2 GABA B receptors are hgb2 GABA B 

0 receptors as defined by SEQ. ID NO.: 3 (human) and rgb2 GABA e receptors as 
defined by SEQ. ID NO.: 4 (rat)) or allelic variants thereof. 

Other objects and advantages of the present invention will be apparent from a 
consideration of the specification. 

5 BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 provides schematic diagrams of hgb2 (panel A), rgb2 (panel B). 
rgblb (panel C), and rgbla (panel D) sequences and shows the relative location of 
oligonucleotide primers and restriction enzyme sites used in constructing the cDNAs 
and the location of oligonucleotide primers used to make riboprobes for in situ 
hybridization histochemistry. Left facing arrows represent sense primers, right facing 
arrows represent antisense primers. The cloning of hgb2 is detailed in Example 1 ; the 
cloning of rgb2 is detailed in Example 2: the cloning of rgb lb and rgb la is detailed 
in Example 3. 

Figure 2 illustrates the alignment of human gb2 and rat gb2 with rat gbla and 
gblb receptor amino acid sequences. Dots indicate residues identical in all receptors. 
Hydrophobic domains are denoted by bars and labeled with the transmembrane 
domain (TM) numbers. Potential N-iinked glycosylation sites are designated by an 
asterisk over the appropriate residues. 

Figure 3 illustrates hgb2 (panel A) and rgbla (panel B) receptor-mediated 
modulation of forskolin-stimulated cAMP synthesis in hgb2/293-I0 and rgbIa/293-18 
cells. hgb2/293-t() and rgb I a/293- 18 cells stably expressing the hgb2 and rgbla 
receptors, respectively, were generated and assayed for modulation of forskolin- 
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stimulated cAMP synthesis as described in Example X. Data are presented as the 
percent of total cAMP synthesized in the presence of forskolin only. Error bars 
represent the standard error of the mean for quadruplicate samples. The data 
presented are from a single representative experiment that has been replicated twice. 

Figure 4 illustrates hgb2 mRNA expression in CNS and peripheral tissues as 
determined by Northern blot analysis. Ciontech multiple human tissue Northerns 
were hybridized with a probe directed against the hgb2 receptor. Weak bands were 
observed in heart and spinal cord human mRNA. 

Figure 5 illustrates the localization of rgb2 and rgbl receptor mRNA by in situ 
hybridization histochemistry. Coronal 12 urn thin sections of a rat forebrain are 
shown. Brightfield (A and C) and darkfield (B and D) illumination is demonstrated of 
the identical visual fields; A and B show rgbl and C and D show the rgb2 receptor 
mRNA. Note the lack of grains over the caudate nucleus and the septum in C and D. 
Abbreviations: c = caudate nucleus; cx = cerebral cortex: PO = preoptic area; s = 
septum. The scale bar represents 1 mm. 

Figure 6 illustrates the localization of rgb2 and rgbl receptor mRNA by in situ 
hybridization histochemistry. Coronal 12 urn thin sections of a rat forebrain are 
shown. Brightfield (A and C) and darkfield (B and D) illumination is demonstrated of 
the identical visual fields; A and B show the rgbl and C and D show the rgb2 receptor 
mRNA. Note the lack of grains over the hypothalamus in C and D. Abbreviations- 
Hi « hippocampus: Hth = hypothalamus: Th - thalamus. The scale bar represents i 
mm. 



DESCRIPTION OF THE PREFERRED EMBODIMENTS 

The present invention relates to a new subtype of mammalian GABA 8 receptor 
which has been designated gb2 GABA 3 . In particular, the present invention relates to 
nucleic acids (DNA) encoding gb2 GABA B receptors, to vectors and host expression 
systems containing these nucleic acids, to isolated and purified gb2 GABA H receptors, 
to isolated and recombinantly-produced antigens encoded by these nucleic acids, and 
to antibodies produced against these antigens. Using expressed sequence tag (EST) 
data base searches, polymerase chain reaction, and library screening, we have 
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identified and isolated human and rat cDNAs lor novel mammalian GABA,, receptors, 
designated hgb2 and rgb2. with homology to gb la and gb lb. 

The DNA of the invention is an isolated or essentially pure DNA sequence 
(i.e., polydeoxyribonucleotide) encoding a protein which comprises the new 
5 mammalian subtype gb2 GABA„ receptors. As used herein, the term "isolated" and 
variations thereof means that the DNA is in isolation from DNA encoding proteins 
normally accompanying these receptors or encoding a different GABA receptor. Thus, 
the DNA of the invention includes DNA encoding the gb2 GABA B receptor when that 
DNA has been cloned into a bacteria] vector, such as a plasmid, or into a viral vector 
1 0 that may be harbored by a bacteriophage, provided that such clones are isolated from 
clones that contain DNA encoding other proteins normally accompanying such 
receptors or encoding a different GABA receptor. As used herein, the term 
"essentially pure" and variants thereof means the DNA is substantially free of DNA 
and RNA that does not encode the gb2 GABA B receptor. That is, there will be no 
1 5 more than about 0. 1 percent of other DNA and RNA and preferably no more than 
about 0.01 percent of other DNA and RNA in any sample that contains the DNA 
encoding the gb2 GABA„ receptor. 

Preferably, the DNAof the invention is complementary DNA (cDNA). One 
preferred human cDNA has the nucleotide sequence as defined in SEQ. ID No.: 1 ; 
20 another preferred rat cDNA has the nucleotide sequence as defined in SEQ. ID No.:2. 
The cDNA is approximately 5.5 to 5.8 kilobases in length. The preferred human and 
rat cDNA code for proteins with the deduced amino acid sequence of SEQ. ID NO.:3 
and SEQ. ID NO.:4, respectively. These proteins contain seven transmembrane 
regions (TM1 through TM7 as shown in Figure 2B). As those skilled in the art will 
25 realize, allelic variations of the DNA and protein which do not significantly affect the 
functionality of the DNA and/or protein can also been employed and considered 
within the invention described herein. For purposes of this invention, an "allelic 
variation" of a given DNA and/or protein retains at least about 95 percent of the 
sequence of the comparison molecule (DNA or protein) and maintains the essential 
30 functionality of the comparison molecule. For the DNA of this invention, the 

essential functionality is encoding a mammalian GABA„ receptor of subtype gb2; for 
the protein of this invention, the essential functionality relates to its action as a 
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GABA,, receptor. Thus, such allelic variations may involve single or multiple 
mutations so long as the functionality remains intact. 

The DNA of the invention may be obtained by various methods, as detailed in 
the examples, involving known molecular biology techniques. The present invention 
also encompasses isolated or purified clones from mammalian tissue DNA library, 
which clone contains a DNA molecule encoding a mammalian gb2 CABA B receptor. 
Preferably, the clones comprise an essentially pure culture of bacteriophage 
containing the cDNA of SEQ. ID. NO, 1 or SEQ. ID. NO,2 inserted into the genome 
of the phage. Alternatively, the clone comprises an essentially pure culture of 
bacteria, such as £. coli, containing the cDNA of SEQ. ID. NO.: 1 or SEQ. ID. NO.:2 
inserted into a plasmid (e.g., pUC-18, pBSSK II (-), pcDNA/amp, pcDNA3.1, and the 
like) in the bacteria. Cosmids. BACE YAC vectors, and X phages could also be used 
as expression systems for these genes. 

The DNA of the invention can be used to transform procaryotic cells, such as 
bacteria, fungi, or other microorganisms, or transfect eucaryotic cells, such as yeast or, 
mammalian cells. Transformation or transfection is accomplished by techniques 
known in the molecular biology. The DNA of the invention maybe joined to a wide 
variety of other DNA sequences for introduction into an appropriate host cell. The 
companion DNA will depend upon the nature of the host, the manner of introduction 
of the DNA into the host, and whether episomai maintenance or integration is desired. 
Mammalian cells are the preferred host. 

Generally, DNA is inserted into an expression vector, such as a plasmid, in 
proper orientation and correct reading frame for expression. If necessary, the DNA 
may be linked to the appropriate transcriptional and translational regulatory control 
nucleotide sequences recognized by the desired host, although such controls are 
generally available in the expression vector. The vector is then introduced into the 
host through standard techniques. Generally, not all of the hosts will be transformed 
or transfected by the vector. Therefore, it will be necessary to select for transformed or 
transfected host cells. One selection technique involves incorporating into the 
expression vector a DNA sequence, with any necessary control elements, that codes 
for a selectable trait in the transformed cell, such as antibiotic resistance. Alter- 
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natively, the gene lor such selectable trail can be on another vector, which is used to 
co-transform or co-transfect the desired host cell. 

The preferred expression vector for use in the invention is the plasmid 
pcD3.1hgb2. This plasmid can be prepared from the expression vector pcDNA3.1. 

After the modified expression vector has been prepared, it is transfected into a 
mammalian cell, using techniques known in the art. The mammalian cell may be any 
one that can be transfected with the DN A of the present invention. 

Preferably, the cell is one that can be stably transfected.with the DNA so that 
the DNA is integrated into the genome and expressed on an essentially continuous 
basis in the cell or its progeny. Further, the cell to be transfected is preferably one 
from a mammalian cell line that is capable of continuous growth in a suitable culture 
medium. Preferred types of cell lines include those derived from human embryonic 
kidney cells, such as HEK-293. neuronal cells, such as the type of pheochromocytoma 
cells known as PC 12 ceils, fibroblast cells, such as A9 L cells, epithelial-like cells, 
such as Chinese hamster ovary (CHO) cells, and a glial or glial-like cells, such as c6 
glioma cells. 

After transfection. the cells are evaluated by screening to identify and isolate 
those that have taken up the expression vector. Preferably, the expression vector will 
be stably inserted into the genome of the cell. If the stably transfected cells were 
originally from a continuous cell line, the cell can be clonally expanded to provide a 
cell line that is capable of continuous growth in a suitable culture medium and which 
expresses a mammalian gb2 GABA G receptor. 

A person skilled in the art, using known techniques and the teachings 
disclosed herein, can use the naturally variant or derived DNA of the invention to 
prepare sequencing vectors, expression vectors, transformed or transfected cells, and 
transfected cell lines as disclosed herein. Accordingly, these are within the scope of 
the present invention. 

The cells and cell lines of the present invention can be used in the 
development of better drugs for the possible treatment of neurological and psychiatric 
disorders wherein GABA D receptors play an important role. The DNA. clones, and 
vectors of the present invention can be used for the construction of such cells and cell 
lines. Cells and cell lines that express mammalian gb2 GABA a receptors, particularly 
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those that express human gb2 GABA B receptors, will be extremely useful in the study 
of receptor function and its implications in neurological and psychiatric disorders. 
Relative to animal models, such cell culture assays are quick, inexpensive, and highly 
standardized. In addition, the cloned DNA sequences can be used to measure the 
mRNAs that encode these receptors, thus measuring tissue specific expression. 
Moreover, receptor pharmacology can be determined in isolation from other related 
receptors, including different GABA receptor types or subtypes. Therefore, these 
assays will be tools in the effort to provide a scientific basis for the rational 
development to new drugs. 

In addition, the assays can be used to screen drugs in order to find or develop 
gb2 GABA B antagonists (substances that block the receptors) or agonists (substances 
that stimulate the receptors). Generally known techniques, modified in accordance 
with the discoveries and teachings disclosed herein, can be used with the cells and cell 
lines of the invention to evaluate potential antagonists or agonists by generating dose 
response, saturation, inhibition, or displacement curves. Initial candidates can be 
further tested for specificity, binding affinity, and activity. 

Whether or not a chemical binds to a mammalian gb2 GABA B receptor can be 
determined by contacting the chemical with a cell or a part thereof from a host cell 
transformed by the DNA of the invention and determining the ability of the cell or 
part thereof to bind to the chemical in the appropriate assay in the presence of a 
chemical known to bind to a gb2 GABA B receptor, such as GABA and baclofen. 

Gb2 GABA B receptor agonists are normally identified using both binding and 
functional assays. Such functional assays include the so-called second messenger 
assays. When neurotransmitters bind to receptors on the surface of cells, the 
metabolism of the various molecules is altered. When these molecules effect the 
physiology of the cell, they are referred to as second messengers. Two second 
messenger pathways that have been well characterized are inositol phosphate and 
cyclic adenosine monophosphate (cAMP) metabolism. See Berridge, Annu. Rev. 
Biochem. 56, 159 (1987) and Gilman, Annu. Rev. Bioahem. S6, 615 (1987). cAMP 
metabolism can be examined by various methods. The activity of the enzyme 
adenylate cyclase or the levels of cAMP can be measured. Thus, the invention 
provides an in vitro method for evaluating a chemical to determine if it is a gb2 
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GABA B receptor agonist. The chemical is contacted with a cell transformed by the 
DNA of the invention. The effect of the chemical on a second messenger pathway of 
the cells is then measured using conventional techniques. 

Cells and cell lines expressing mammalian gb2 GABA B receptors can be used 
to provide a sufficient amount of the proteins for the preparation of antibodies using 
conventional techniques. The antibodies can be used to define more clearly the 
localization of receptor types than is possible using in situ hybridization to mRNA. 
They could also be used for diagnostic purposes if clinically significant abnormalities 
were found in association with the gb2 GABA B receptors. Moreover, they would 
permit purification of the receptor proteins by affinity chromatography. Highly 
purified gb2 GABA B would permit the 3-dimensional protein to be studied by X-ray 
crystallography. Such information would be especially useful in rational drug design. 

The molecular biology techniques used in making and using the present 
invention are generally known in the art. For example, suitable techniques and 
procedures arc generally described in "Molecular Cloning: A Laboratory Manual," 2 1 " 
Ed.. Sambrook. Fritsch, and Maniatis, Cold Spring Harbor Laboratory, Cold Spring 
Harbor, NY, 1989; and "Current Protocols in Molecular Biology," Ausubel et al., 
John Wiley and Sons, N.Y., 1987 (updated quarterly). 

Hgb2 and rgb2 Amino Acid Sequences . Through EST data base searches, 
polymerase chain reaction, and library screening as described in the Examples, human 
and rat homologues of a novel receptor (hgb2 in human and rgb2 in rat) have been 
identified and isolated. These receptors have homology to the GABA n receptors 
(rgbla and rgblb from rat) identified by ftaupmann et al. (Nature 386, 239-246 
(1997)). Figure 1 shows a schematic of the hgb2, rgb2, rgbla and rgblb receptor 
nucleotide sequences denoting the placement of oligonucleotide primers and 
restriction enzyme sites used to generate each of these constructs and the expression 
plasmids. Hgb2 receptor fragments were amplified from human brain cDNA while 
rgb2 was isolated from a rat cortex cDNA library followed by amplification of rat 
brain cDNA to obtain the 5' end of the receptor including the start site. The resulting 
5786 base pair sequence generated for hgb2 includes an open reading frame of 2826 
bases. The combined 5' RACE (i.e.. rapid amplification of cDNA ends) product and 
insert for rgb2 spans 5459 base pairs and includes an open reading frame of 2823 
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bases. The cDNA sequences lor hgb2 and rgb2 predict proteins of 942 and 941 amino 
acid, respectively, with relative molecular mass of 104 kD. 

Sequence similarity and hydropathy profiles (Kyte et a!.. J. Mol. Biol. 157. 
105-132 (1982) of the predicted proteins demonstrate that they share structural 
i features with the GABA 0 receptor rgb I (Kaupmann et al.. Nature 386. 239-246 

(1997)). the parathyroid cell calcium-sensing receptor, and the family of metabotropic 
glutamate receptors (for review see Pin and Duvoisiru Neuropharmacol. 34, 1-26 
(1995)). Significant features in the hgb2 and rgb2 sequences include 7 putative 
transmembrane regions, a large extracellular amino terminus of 476 residues, and 4 
potential sites for N-Iinked glycosylate (Figure 2). There is an abundance of proline 
residues in the amino terminus of both hgb2 and rgb2, with a string of 9 proline 
residues only 9 residues into the sequences. 

Comparison of the amino acid sequences for hgb2 and rgb2 with that for rgbla 
and rgb 1 b (Figure 2) shows 28% amino acid identity which rises to 38% when - 
conservative amino acid substitutions are taken into account. Homology between the 
gb2 and gbl receptors is not limited to the hydrophobic regions but instead spans the 
entire protein. Search of Genbank indicated that hgb2 and rgb2 share significant - 
sequence homology with the metabotropic glutamate receptors. The Genbank search 
also showed a limited degree of homology between the amino acid sequence of hgb2 
and rgb2 and the human and rat calcium-sensing receptors. 

Hgb2 Encode s a GAK An Receptor . A functional screen of HEK293 cells 
stably transfected with pcD3. Ihgb2 or pcD3.lrgbla revealed several cell lines 
expressing these receptors to varying degrees. Preliminary data is shown in Figure 3 
for the stable cell lines hgb2/293-10 and rgbla/293-1 8. Adenylyl cyclase was 
stimulated as described in the Example 8 with 10 uM forskolin in the presence or 
absence of 30 uM baclofen or 30 uM GAB A. 1 00 uM AOAA (a GABA transaminase 
inhibitor), and 100 uM nipecotic acid (a GABA uptake blocker). hgb2/293-I0 cells 
exhibited a 65% reduction in forskolin-stimulated cAMP synthesis with baclofen and 
a 50% decrease with GABA plus AOAA and nipecotic acid (Figure 3). A 40% 
reduction in forskolin-stimulated cAMP synthesis was observed for reb la/293- 18 
cells treated with baclofen and a 45% decrease was observed for GABA plus AOAA 
and nipecotic acid (Figure 3). Neither baclofen nor GABA plus AOAA and nipecotic 
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acid in the absence of Ibrskblin had any affect on cAMP synthesis ( i-igure 3 ). These 
data suggest that the hgb2 receptor, like the rgbla and rgbl b receptors, is a GABA n 
receptor since the criteria for baclofen sensitivity has been established. Kaupmann et 
al. (Nature 386. 239-246 ( 1 997)) observed similar reduction in forskolin-stimulated 
5 cAMP synthesis with HEK293 cells stably expressing the rgbla receptor. However, it 
is worth noting that it took 300 uM baclofen to observe a 30% decrease in forskolin- 
stimulated cAMP synthesis as compared to the 30 uM used in the current study which 
accomplished a 40% reduction. These differences may be due to differences in the 
levels of receptor expression between the rgbla stable ceils and rgbl a/293-1 8 cells. 
0 Acquisition of a radiolabeled antagonist for binding analyses will be necessary before 
differences in receptor cell number may be addressed. Work is underway to further 
characterize the pharmacology of these receptors and to establish a rank order of 
potency for GAB A B receptor agonists and antagonists. 

' Rgb2 Expression in Rat CIVS and PeHnh.r.1 T Analysis of Northern 

blots of human mRNA from several brain regions and peripheral tissues with a probe 
directed against the hgb2 receptor reveals a single mRNA species of 6.2 kb (Figure 4). 
Hgb2 receptor mRNA is expressed in all of the brain areas tested although the level of 
expression seems to vary significantly with the highest being found in the cortex and 
thalamus and the lowest in the corpus callosum, caudate nucleus, and medulla. In 
addition, a weak 6.2 kb band is detected in spinal cord mRNA. No hgb2 receptor 
mRNA is detected in any of the peripheral tissues examined with the exception of the 
heart where a weak band is observed. 

In situ hybridization histochemistry was carried out using sense and antisense 
probes directed against both the rgb2 and rgbl receptors for comparison as described 
in the Examples. The rgbl receptor probe is directed against a region common to both 
the rgbla and rgb 1 b receptors (see Examples and Figure 1 ) therefore mRNA for both 
receptors will be hybridized. Rgbl receptor mRNA is heterogeneously expressed in 
abundance throughout all regions of the rat brain as previously described (Kaupmann 
et al.. Nature 386, 239-246 ( 1 997)). Rgb2 receptor mRNA is also abundant in rat 
brain, but its distribution is somewhat more restricted. There are significantly lower 
levels of rgb2 receptor mRNA hybridization in the caudate-putamen. septum, medial 
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basal hypothalamus, and brainstem of the rat brain as compared with levels of rab I 
receptor mRNA (Figures 5 and 6). The levels and distribution olrgb2 and rgbl 
receptor mRNA hybridization in all other brain regions examined appeared to be 
nearly identical. In contrast to rgbl receptor mRNA. hybridization to rgb2 receptor 
mRNA was not detected in white matter, suggesting that expression of rgb2 receptor 
mRNA is restricted to neurons. Kaupmann et al. (Nature 386, 239-246 (1997)) 
showed that rgbl mRNA is expressed in neurons but not glia in the CA3 field of the 
hippocampus. Detailed examination of other brain regions has revealed that rgbl 
mRNA is found in glia as well. While hybridization to rgb2 mRNA has not been 
detected in glia. more detailed studies of rgb2 mRNA expression and protein 
localization at the cellular level are necessary before this possibility can be ruled out. 

While it is difficult to make a direct extrapolation from mRNA hybridization 
to protein location, the data presented herein are generally in agreement with 
previously published binding data (Chu et al., Neuroscience 34. 341-357 (1990): 
Bowery et al.. Neuroscience 20, 365-383 (1987)). Binding of GABA in certain 
regions of the rat brain including the septal area, caudate putamen. hypothalamus and 
brainstem, with the exception of the interpeduncular nucleus, is relatively low (Chu et 
al.. Neuroscience 34, 341-357 (1990); Bowery et al.. Neuroscience 20. 365-383 
(1987)) in agreement with the levels of hybridization which were detected for rgb2. 

The following examples are intended to illustrate the invention and not to limit 
the invention. All references cited in the present specification are hereby incorporated 
by reference. 



25 



Example 1 - Cloning nf h ff IO a tblastn search of the Genbank EST 
database using the amino acid sequence of the rat gbl receptor as query detected three 
ESTs (R76139. R80448, and Z43654) which showed distant homology to 
transmembrane domains 5-7 and another EST (T0726 1 ) which showed similarity to 
transmembrane domains 1 -3. Primers designed from these sequences (JCI 36 
30 gcgggatccCTTCGGCACGAATACCAGGCAGAGGGT for 5' extension; JCI 37 
ccggaattcC AGAAGCTCATAAAGATGTCGAGTCC and JCI 35 
ccggaattcTTCTTAC.CTTTGGGAGACCCGCAACGT for 3' extension) were used 
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with the Clontech API primer to do 5' and 3' RACE using Clontech Marathon-Ready 
human brain cDNA as template and touchdown PCR conditions (94"C. 30 sec and 
72°C. 4 min for 5 cycles: 94°C. 30 sec and 70°C. 4 min for 5 cycles: 94°C. 30 sec 
and 68°C. 4 min for 20 cycles). JC135 and JC136 primers were derived from the 
transmembrane domains 5-7 ESTs while JC137 was derived from T07621. JCI36 
and JC137 primers were used together to generate a product of 722 bases which, when 
sequenced, confirmed that the ESTs were derived from the same mRNA. 3' products 
from RACE with JC135 and JC137 were reamplified with the Clontech AP2 primer 
yielding two products of 1 .2 and 1 .7 kb is size but lacking a poIy(A) signal. 

A tblastn search of the Genbank EST database with the 3' sequences detected 
an EST (AA323988) which overlapped with the query sequence and extended the 
sequence through an A-rich region. Gb2-15 

(AGCACTAGAACTCCAGCTGGAAGTCA) was designed based on EST 
AA323988 and used in a 3' RACE reaction as described above to yield a 2.3 kb band 
that when sequenced was shown to include a poly(A) signal. 5' RACE with JC 136 
yielded a 0.88 kb Not 1/BamH 1 fragment which when sequenced showed weak amino 
acid homology to rgbl. JC153 (GCCTACGATGGCATCTGGGTCATCGC) was 
designed based on the JC1 36 5' RACE product sequence and paired with JCI36 to 
yield a 1.1 kb fragment. Further 5' extensions were obtained by using Clontech 
Advantage-GC PCR (1 ul GC-melt per 50 uJ reaction^ PCR conditions: 94°C, 15 sec 
and 68°C. 3.5 min for 30 cycles) and nested primers JC152 
(CAGATTCCAGCCTTGGAGGGACTCTGC) followed by JC186 

(TGCATCGTAGAAGGCTTTCAACCCTTTTGC) and adaptor primers API 
followed by AP2. 

25 pcDhgb2I was generated by subcloning a Nae i/Xho I fragment from a 

JC153/JC165 (JC165: AATAAGGCTCGAGGTCAGGTGCCAA) 2.2 kb PCR 
product and an EcoR I/Nae I fragment from a JC171/JC154 (JC 171 : 
CAGCAGCCCGCCGCTCTCCATCAT; JC154: 

gcgcgtaatacgactcactatagggGTAGTTGAAGTCCTGGATCC G) 1 kb PCR product 
30 into EcoR I/Xhol digested pcDNAl/amp (Invitrogen). The expression construct 

pcDhgb2II was constructed by subcloning a Not I/Xmn I 5' RACE fragment obtained 
with AP2/JC186 into a Not I/Xmn 1 digested pcDhgb2I. PcD3.1hgb2 was made by 
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subcloning an EcoR l/Xba 1 fragment from pcDhgb2 into EcoR l/Xba I digested 
pcDNA3.1 (Invitrogen). 



Ex*mp\*2- Cloning of rg h2 . Rgb2 was obtained by screening pools of 
clones from a rat cortex cDNA library using Southern blot analysis." DNA was 
prepared from each of 49 pools representing an average of 5 x 10' clones per pool and 
digested with Xho I to release the inserts from the vector DNA. 5 ug of Xho I 
digested pool DNA was run on a 0.8 % agarose gel and transferred to a nitrocellulose 
supported membrane (BA-S 85. Schleicher & Schuell) using a TurboBlotter 
(Schleicher & Schuell). Nitrocellulose membranes were hybridized with a "P-labeled 
random primed (Boerhinger Mannheim) hgb2 JC137/JC136 (JC137: 
ccggaattcCAGAAGCTCATAAAGATGTCGAGTCC: JC 1 36: 
gcgggatccCTTCGGCACGAATACCAGGCAGAGGGT) PCR fragment in 3x 
Denhardts and 3x SSC at 60"C. Hybridization was followed by two 15 min washes 
in 3x SSC at 60°C. Blots were imaged on the Fuji* Phosphorlmaging system (Fuji). 
Bacteria from two pools was each used to make 22 subpools. DNA was prepared 
from these subpools and analyzed as described above. Four positive subpools (101, 
10M. 370. and 37Y) were analyzed further in subpools representing 3 x I0 J clones 
per subpool. Nitrocellulose filter lifts were made of successive platings of subpools 
1017 and 3709 and processed following standard protocols ("Molecular Cloning: A 
Laboratory Manual," 2- Ed.. Sambrook. Fritsch. and Maniatis. Cold Spring Harbor 
Laboratory, Cold Spring Harbor. NY, 1989; and "Current Protocols in Molecular 
Biology," Ausubel et al. f John Wiley and Sons, N.Y.. 1987 (updated quarterly)). 
Filters were hybridized as described above followed by two washes with 3x SSC at 
60 a C for 15 min and 1 wash in 3X SSC at 60°C for 1 nr. Filters were exposed to 
BioMax MR film (Kodak) for 16 nr. 

Two positive overlapping clones were isolated from this screen, 10175a and 
37094a. The 5200 kb clone 37094a was sequenced and found to lack a start site. 
RACE was performed using the high GC conditions described above with nested 
primers JC 1 90 (CGCTGGTGCAGGGATCATTGGAGAA) and JC 1 91 
(CATATTCTGGTCAAACTGGCCAAGG) paired with API followed by AP2 using 
Clontech Marathon-Ready rat brain cDNA as template to yield the 5' end of the rgb2 
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receptor. A Xho I/Bgl II fragment from the 5' RACE olrgb2 and a Bgl II fragment 
from 37094a were subcloncd into Xlio I/BamH I digested pBSSK. II (-). The corrct 
orientation of the Bgl II fragment was established using restriction enzyme mapping 
with Xhol and BamH I. PcD3. 1 rgb2 was made by subcloning a Kpn I/BamH I 
fragment of the pBSSK II (-) plasmid with a BamH I/Xba I fragment from 37094a " 
into Kpn I/Xba I digested pcDNA3.1 . 

Example 3 - Cloning of rgblh and rghln Rgbib was obtained by 
amplification of Clontech rat brain Marathon-Ready cDNA with JC140 
(ccggaattcccaccATGGGCCCGGGGGGACCCTGTACC) and JC 1 41 
(CTAGTCTAGATCACTTGTAAAGCAAATGTACTCGACT). pBSrgblb was 
made by ligation of an EcoR I/Xba I fragment of rgbl b into EcoR I/Xba I prepared 
pBSSKII(-). pcDrgblb was constructed by subcloning an EcoR I/Not I fragment from 
pBSrgblb into EcoR I/Not i prepared pcDNAl/amp (Invitrogen). Rgbla 5' sequence 
was obtained by 5' RACE with API or 2 paired with JC 144 (GAGTAG 
TTCGTACAAGTACTTGGTGGC) using Clontech rat brain Marathon-Ready cDNA 
as template. An rgbla 5' fragment for vector construction was generated by 
amplification of the JC144 RACE product with JC139 

(ccggaattcccaccATGCTGCTGCTGCTGCTGGTGCCTCTC) and JC 1 44. pBSrgb 1 a 
was made by ligation of an EcoR I/Hpa I fragment from the JC139/JCI44 PCR 
product into EcoR I/Hpa I digested pBSrgblb. pcDrgbla was constructed by 
subcloning an EcoR I/BstE II fragment from the pBSrgbla into EcoR I/BstE II 
digested pcDrgblb. An EcoR I/Not I fragment from pcDrgbla was subcloned into 
pcDNA3.1 (Invitrogen) to make pcD3.1 rgbla. 

Example 4 - Northern Analysis Clontech multiple human tissue Northerns 
were hybridized with the 0.88 kb Not I/BamH I fragment (bases 732-1613 of hgb2) at 
42°C in 5X SSPE. 10X Denhardfs, 50% formamide. 2% SDS. 100 ug/ml denatured 
salmon sperm DNA. Following hybridization blots were washed three times for 10 
min each in 0.2X SSPE. 1% SDS at 60°C. Blots were imaged on the Fujix 
Phosphorlmaging system (Fuji). 
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Example 5 - In Situ Hybridisation Histochemistry . Preparation of rat 
brain sections, prehybridization and hybridization of rat brain slices was performed 
using conventional techniques {sec, for example. Bradley et al.. J. Neurosci. 12, 2288- 
2302 (1992); a general protocol can also be found on the following website: 
5 http://intramural.nimh.nih.gov/lcmr/snge/ProtocoLhtml). Sections were hybridized 
with labeled antisense and sense riboprobes directed against rgb2 or rgbl . Probes 
were generated by amplification of rgb2 with JC216 paired with JC21 7 or with JC218 
paired with JC219 (JC216: 

cgcgcaattaaccctcactaaaggACAACAGCAAACGTTCAGGC; JC21 7: 
10 gcgcgtaatacgactcactatagggCATGCCTATGATGGTGAG; JC21 8: 
cgcgcaattaaccctcactaaaggCTGAGGACAAACCCTGACGC; JC2 19: 
gcgcgtaatacgactcactatagggGATGTCTTCTATGGGGTC) or by amplification of rgbl 
with JC 160 paired with JC1 61 (JC160: 

cgcgcaattaaccctcactaaaggAAGCTTATCCACCACGAC; JCI61 : 
1 5 gcgcgiaatacgactcactatagggAGCTGGATCCG AGAAGA A) and labeled as described 
previously (Bradley et al., J. Neurosci. 12, 2288-2302 (1992); the general protocol can 
also be found at http://intramural.nimh.nih.gov/lcmr/snge/Protocol.html). 

Example 6 — Cell Culture and Stable Expression of hgb2 and rgbl a . 

20 HEK.-293 cells, human embryonic kidney cells, were cultured in DMEM 

supplemented with 10% fetal calf serum (Sigma), 1% penicillin/streptomycin 
(GIBCO/BRL), at 37°C in 5% CO : . HEK-293 cells grown to 75% confluency in 10- 
cm tissue culture plates were used for control, pcD3. Ihgb2 and pcD3.1rgbla 
transfections. HEK-293 cells were exposed to either a solution of cationic iipid/DNA 

25 (Lipofect AMINE; GIBCO/BRL) in DMEM without fetal calf serum or 
penicillin/streptomycin (70 \xg LipofectAMINE/10 |xg of pcD3.1hgb2 or 
pcD3.1rgbla) or a solution containing cationic lipid alone (70 |ig LipofectAMINE) in 
DMEM, for 5 hr at 37°C. After incubation, lipid/DNA solutions were supplemented 
with equal volumes of DMEM with 20% fetal calf serum and cells were set at 37°C 

30 overnight. Within 24 hr of transfection media was removed and replaced with serum- 
and penicillin/streptomycin-supplemented DMEM. Transfected cells were allowed to 
recover for 24 hr after which each 10-cm plate was split to five 10-cm plates in 
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DMEM supplemented with !()% fetal calfserum. 1% penicillin/streptomycin and 375 
Mg/ml geneticin (G4I 3; and GIBCO/BRL) for a period of 2-4 weeks. Resistant 
colonies were isolated and transferred to 24-well plates. Those cells thai survived the 
isolation procedure were cultured for preparation of total RNA for dot blot analysis 
(Example 7) and for cAMP response assays (Example 8) to screen for expression of 
hgb2 and rgbl a. 

Example 7 - Detection of hgb2 and rp h la RNA hv Hot Blot Analy ^y The 
cells from Example 6 were grown in 6- well plates to 75-100% confluency. Total 
RNA was prepared using the protocol provided with Trizol (GIBCO/BRL). Briefly, 
Trizoi was added to each well and cells were triturated and transferred to 1.5 ml 
mierofuge tubes. Samples were extracted with chloroform followed by precipitation 
with 2-propanol. Pellets were resuspended in 7.4% formaldehyde/6X SSC and stored 
at -20°C until analysis. Half of each sample was diluted in equal volume of 7.4% 
formaldehyde/6X SSC. denatured and applied to a supported nitrocellulose membrane 
using a dot blot apparatus. RNA was fixed to the membrane by UV crosslinking 
(Stratalinker. Stratagenc) and blots were prehybridized in 6X SSC. 5X Denhardt's, 
0.5% SDS. 50% formamide and 100 fag/mi denatured salmon sperm DNA at 42 T for 
at least 2 hr. Blots were hybridized with 32 P-Iabeled random primed PCR fragments 
for hgb.2 and rgbl a (hgb2: JC137/JC136; rgbl a: JC160/JC161), 1-5 x 10 6 cpm/ml. 
were denatured and added directly to the prehybridized solution. Blots were 
hybridized at 42°C overnight followed by one 20 min wash at 25°C with 2X SSC, 
0.1% SDS and two 30 min washes at 68°C with 0.2X SSC. 0.1% SDS. Blots were 
imaged on the Fujix Phosporlmaging system. Data tabulation is underway. 

Example 8 - Determ ination of Cellular cAMP Response HEK-293 cells 
from Example 6 which stably express the hgb2 and rgbl a receptors were lifted in IX 
PBS, 2.5 mM EDTA, counted, pelleted and resuspended at 1.5 x 10 5 ceils per 100 ul 
in Krebs-Ringer-Hepes medium, 100 juM Ro 20-1724 (RBI; Natick. MA) and 
incubated at 37°C for 20 min. 100 u.1 ceils was added to 100 fil of prewarmed (37°C. 
1 0 min) Krebs-Ringer-Hepes medium. 1 00 yiM Ro 20- 1 724 without or with agonists, 
antagonists and/or forskolin. Following a 20 min incubation at 37°C the assay was 
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terminated by setting the cells on ice and centriiuginy at 2000 rpm for 5 min at 4°C. 
1 75 ul of assay solution was removed and replaced with 1 75 ul of 0. 1 N hydrochloric 
acid. 0. 1 mM calcium chloride and cells were set on ice for 30 min and then stored at - 
20°C. cAMP determinations were made using a solid phase modification (Maidment 
5 et al., Neurosci. 33. 549-557 ( 1 989)) of the cAMP radioimmunoassay described by 
Brooker et al.. Adv. Cyclic Nucleotide Res. 10, 1-33 (1979). Immulon II removawells 
(Dynatech; Chantilly, VA ) were coated overnight with 100 .ul of protein G (1 mg/ml 
in 0.1M NaHCOj, pH 9.0) at 4°C. Prior to use protein G-coated plates were rinsed 
with PBS-gelatin-Tween (phosphate buffered saline containing 0.1% gelatin, 0.2% 
0 Tween.20) 3 times quickly and then once for 30 minutes. Following the rinse with 
PBS-gelatin-Tween the RJA was set up by adding 100 ul 50 mM sodium acetate, P H 
4.75. cAMP standards or aliquots from treated cells, 5,000-7.000 cpm l25 [-succinyl 
cAMP, and 25 ul of a sheep antibody to cAMP diluted in 50 mM sodium acetate, pH 
4.75 (Atto instruments: dilution of stock to 2.5x10 *, determined empirically) to the 
plates in a final volume of 1 75 ul. Plates were incubated 2 hr at 37 C C or overnight at 
4°C, rinsed 3 times with sodium acetate buffer, blotted dry, and then individual wells 
broken off and bound radioactivity determined in a gamma counter. EC 50 values were 
determined by nonlinear least squares fit of the data to the equation 

Y = A 0 + (A rn *Ly(EC S() + L) 



where Y is the amount of cCMP (picomole/well), A„ is the basal level of cAMP, A„, is 
the maximal accumulation under these conditions, L is the added ligand 
concentration, and EC 50 the apparent half-maximal concentration for stimulation of 
cAMP accumulation. Data tabulation is underway. 
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SEQUENCE LISTING 

( I) GENERAL INFORMATION: 

(i) APPLICANT: Clark. J. and Bonner, I\ 

(ii) TITLE OF INVENTION: Mammalian gb2 GAB A» 

(iii) NUMBER OF SEQUENCES: 4 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Fitch, Even, Tabin & Flannery 

(B) STREET: 135 South LaSalle Street, Suite 900 

(C) CITY: Chicago 

(D) STATE: IL 

(E) COUNTRY: USA 

(F) ZIP: 60603-4277 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette 

(B) COMPUTER: IBM Compatible 

(C) OPERATING SYSTEM: Windows 

(D) SOFTWARE: FastSEQ for Windows Version 2.0 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: None 

(B) FILING DATE: 

(viii) ATTORNEY/ A GENT INFORMATION: 

(A) NAME: Kaba, Richard A 

(B) REGISTRATION NUMBER: 30.562 

(C) REFERENCE/DOCKET NUMBER: 62527 
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(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 312-372-7842 

(B) TELEFAX: 312-372-7848 

(C) TELEX: 

(2) INFORMATION FOR SEQ ID NO: I : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5786 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: I : 

GCCGCCCCTG AGCAGCCTCG CCTTCGCCTC CCGCGTTTCC TGCCGTCCGC CCTCCCCCGG 60 

CCGAGCTCCA GGGGCTGCCG CCTAGCAGCT CCCGGCGGGA GAGCGGTTCA GACCGCGCAC 120 

GGGGGCGGGC GGAGGCGGCC CGGTGCGGGG CGGCCGCGCT GGAGAGAGGC GCGCGCGGAG 180 

ACGCCGGCCC CCCTCTCCGC GTTTGCTCGC TTGCTCCCCG CCTCCCCCAC TCCGCTCGCT 2 40 

CCCACCCCTT CCCGGCGTGA TTGATCCGTC ACGGGCGCCG CCGCTGCCGC CGCCGCCGCG 300 

GCCGTTCTGA GCCGAGCCGG AACCCTAGCC CGAGACGGAG CCGGGGCCCG GGCCGGCGCC 3 60 

ATTGCGCGGG CGCCGCGGGA AGACCTTGGC GCGGGGCGGC GGGCCGCGCC AGGCCATGCG 4 20 

GGCCGAGTGA GCCGGCGCCC GCAGCCCGCG GCGCGGCA7G GCTTCCCCGC GGAGCTCCGG 4 30 

GCAGCCCGGG CCGCCGCCGC CGCCGCCACC GCCGCCCGCG CCCCTGCTAC TGCTACTGCT 540 

GCTGCCGC7G CTGCTGCCTC TGGCGCCCGG GGCCTGGGGC TGGGCGCGGG GCGCCCCCCG 600 

GCCGCCGCCC AGCACCCCGC CGCTCTCCAT CATGGGCCTC ATGCCGCTCA CCAAGGAGGT 660 

GGCCAAGGGC AGCATCGGGC GCGGTGTGCT CCCCGCCGTG GAACTGGCCA TCGAGCAGAT 7 20 

CCGCAACGAG TCACTCCTGC GCCCCTACTT CCTCGACCTG CGGCTCTATG ACACGGAGTG 730 

CGACAACGCA AAAGGGTTGA AAGCCTTCTA CGATGCAATA AAATACGGGC CGAACCACTT 8 40 

GATGGTGTTT GGAGGCGTCT GTCCATCCGT CACATCCATC ATTGCAGAGT CCCTCCAAGG 900 

CTGGAATCTG GTGCAGCTTT CTTTTGCTGC AACCACGCCT GTTCTAGCCG ATAAGAAAAA 960 

ATACCCTTAT TTCTTTCGGA CCGTCCCATC AGACAATGCG GTGAATCCAG CCATTCTGAA 1020 

GTTGCTCAAG CACTACCACT GGAAGCGCGT GGGCACGCTG ACGCAAGACG TTCAGAGGTT 1080 

CTCTGAGGTG CGGAATGACC TGACTGGAGT TCTGTATGGC GAGGACATTG AGATTTCAGA 1140 

CACCGAGAGC TTC7CCAACG ATCCCTGTAC CAGTGTCAAA AAGCTGAAGG GGAATGATGT 1200 

GCG GAT CATC CTTGGCCAGT TTGACCAGAA TATGGCAGCA AAAGTGTTCT GTTGTGCATA 1260 

CGAGGAGAAC ATGTATGGTA GTAAATATCA GTGGATCATT CCGGGCTGGT ACGAGCCTTC 1320 

TTGGTGGGAG CAGGTGCACA CGGAAGCCAA CTCATCCCGC TGCCTCCGGA AGAATCTGCT 13 80 

TGCTGCCATG GAGGGCTACA TTGGCGTGGA TTTCGAGCCC CTGAGCTCCA AGCAGATCAA 14 40 

GACCATCTCA GGAAAGACTC CACAGCAGTA TGAGAGAGAG TACAACAACA AGCGGTCAGG 1500 

CGTGGGGCCC AGCAAGTTCC ACGGGTACGC CTACGATGGC ATCTGGGTCA TCGCCAAGAC 15 60 

ACTGCAGAGG GCCATGGAGA CACTGCATGC CAGCAGCCGG CACCAGCGGA TCCAGGACTT 1620 

CAACTACACG GACCACACGC TGGGCAGGAT CATCCTCAAT GCCATGAACG AGACCAACTT 1680 

CTTCGGGGTC ACGGGTCAAG TTGTATTCCG GAATGGGGAG AGAATGGGGA CCATTAAATT 17 40 

TACTCAATTT CAAGACAGCA GGCAGGTGAA GGTGGGAGAG TACAACGCTG TGGCCGACAC 18 00 

ACTGGAGATC ATCAATGACA CCATCAGGTT CCAAGGATCC GAACCACCAA AAGACAAGAC I8 60 

CATCATCCTG GAGCAGCTGC GGAAGATCTC CCTACCTCTC TACAGCATCC TCTCTGCCCT 1920 

CACCATCCTC GGGATGATCA TGGCCAGTGC TTTTCTCTTC TTCAACATCA AGAACCGGAA 1980 
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TCAGAAGCTC ATAAAGATGT CCAGTCCATA CATGAACAAC C77ATCATCC T7GGAGGGAT 2 04 0 

GCTCTCCTAT GCTTCCATAT 7TCTCTTTGG CCTTGATGGA TCCTTTGTCT CTGAAAAGAC C100 

CTTTGAAACA CTTTGCACCG TCAGGACCTG GATTCTCACC GTGGGCTACA CGACCGCTTT 2 ISO 

TGGGGCCATG TTTGCAAAGA CCTGGAGAGT CCACGCCATC TTCAAAAATG TGAAAATGAA 2220 

GAAGAAGATC ATCAAGGACC AGAAACTGCT TGTGATCGTG GGGGGCATGC TGCTGATCGA 2230 
CCTGTGTATC CTGATCTGCT GGCAGGCTGT GGACCCCCTG CGAAGGACAG TGGAGAAGTA ' 22 AO 

CAGCATGGAG CCGGACCCAG CAGGACGGGA TATCTCCATC CGCCCTCTCC TGGAGCACTG 2 400 

TGAGAACACC CATATGACCA TCTGGCTTGG CATCGTCTAT GCCTACAAGG GACTTCTCAT 2 4 60 

GTTGTTCCGT TGTTTCTTAG CTTGGGAGAC CCCCAACGTC AGCATCCCCG CACTCAACGA 2 520 

CAGCAAGTAC ATCGGGATGA GTGTCTACAA CGTGGGGATC ATGTGCATCA TCGGGGCCGC 258 0 

TGTCTCCTTC CTGACCCGGG ACCAGCCCAA TGTGCAGTTC TGCATCGTGG CTCTGGTCAT 2 640 

CATCTTCTGC AGCACCATCA CCCTCTGCCT GGTATTCGTG CCGAAGCTCA TCACCCTGAG 2 700 

AACAAACCCA GATGCAGCAA CGCAGAACAG GCGATTCCAG TTCACTCAGA ATCAGAAGAA 2 "7 60 

AGAAGATTCT AAAACGTCCA CCTCGGTCAC CAGTGTGAAC CAAGCCAGCA CATCCCGCCT 2820 

GGAGGGCCTA CAGTCAGAAA ACCATCGCCT GCGAATGAAG ATCACAGAGC TGG AT AAAGA 2880 

CTTGGAAGAG GTCACCATGC AGCTGCAGGA CACACCAGAA AAGACCACCT ACATTAAACA 2940 

GAACCACTAC CAAGAGCTCA ATGACATCCT CAACCTCGGA AACTTCACTG AGAGCACAGA 3000 

TGGAGGAAAG GCCATTTTAA AAAATCACCT CGATCAAAAT CCCCAGCTAC AGTGGAACAC 3060 

AACAGAGCCC TCTCCAACAT GCAAAGATCC TATAGAAGAT ATAAACTCTC CAGAACACAT 3120 

CCAGCGTCGG CTGTCCCTCC AGCTCCCCAT CCTCCACCAC GCCTACCTCC CATCCATCGG 3180 

AGGCGTGGAC GCCAGCTGTG TCAGCCCCTG CGTCAGCCCC ACCGCCAGCC CCCCCCACAG 32 4 0 

ACATGTGCCA CCCTCCTTCC GAGTCATGGT CTCGGGCCTG TAAGGGTGGG AGGCCTGGGC 3 300 

CCGGGGCCTC CCCCGTGACA GAACCACACT GGGCAGAGGG GTCTGCTGCA GAAACACTCT 3360 

CGGCTCTGGC TGCGGAGAAG CTGGGCACCA TGGCTGGCCT CTCAGGACCA CTCGGATGGC 3 42 0 

ACTCAGGTGG ACAGGACGGG GCAGGGGGAG ACTTGGCACC TGACCTCGAG CCTTATTTGT 3 4 BO 

GAAGTCCTTA TTTCTTCACA AAGAAGAGGA ACGGAAATGG GACGTCTTCC TTAACATCTG 3 54 0 

CAAACAAGGA GGCGCTGGGA TATCAAACTT GCAAAAAAAA AAAAAAAACA AAACAAAAAA 3600 

AAAAACTAGA CAAGGAGAGA GGCACTAGAA CTCCAGCTGG AAGTCACGGA GTGGCTCGAG 3660 

CAGCCTTGGG AAGAGGCAAG GAGCTTCTGA AGAAACTGCC TCTGCACACA CATCACTGGC 3720 

TGTGACCCCT CAGGCTAGCC CTTCTCCACT CTGGGGGAGG AGGTGGGAAG GGCCACCAGG 37 80 

CCCCCAGCTG CCAGGCCAGC TGACCCCAGC CTTCCTGGAA CAGGGAGTCT GCAGGAGCGC 3840 

AG AC AG GC AC AGCCCTGGAG CAGGCAGGCC GAGGGCTGCG GCACTGGAGC AGGCTGACTT 3900 

ACATGCTCCA CATGGGACCT GTGTCACCCA ATGAGATGTT TCTTACTCTG GTAAATGCCA 3960 

CACGTTAACA CAATAACACC CATTCCTGGG ACCGTGGGGA TTTAGGGCAC GTCACTGCAG 4 020 

ACACGCTCTG CAGCATTCAC CGACAGTCTG TCATGCACCC ACCACGTTGG CCACGTCCTT 4 080 

GTGTTCCTAT CCGATGCTCC CAGTAACCAG GGGGACCACC CGAGCTAATC ATGGAATGTC 4140 

TGTTCCCAGC AAACACGATA AAGAAAGATT GTGCACTTTA ACCTC7CTCA TCAGGGCCCA 4 200 

AGGGCTGGCT GGGATTTTTT TTTTTTTTTT TTCCCACTAA CTTTG7TTCT GACCAAAGTG 4 260 

AATTGGACGC ATTCTGCTAA AAGACATCCC CG7AGACATA GGGGAGAGAG TTGCTCGC7G 4 320 

AGGGCTTCCC TTGGCTTCCA GAAGGCAGCC TTCCATCCAG ACAAGCCAGT GAGCTCTCCC 4 380 

CTTGGGATCA CTGGGGTGAT CAGTCAGCAG ATTGATTC7C ATTCATAAGA TCATTCCTCC 44 40 

CTTTAAAT7G AGCCCCTAAG AGCAC7GGCC TGGGAGTCAG ACAGACCTGG GTTCAAGTCC 4 500 

TCAGTCCCCT GCCCACTCCC TACGTGACTT TGATCAGGTC ACTAGTGTCT CTCTGAGCCT 4 560 

CAGTTTCCCC 7CTGTAACTT GGGGTTGAAC TAAAACACCT GTCCTGCCTA CCTCACAAGG 4 620 

TCACTCTGAG GATTGAAACT TGATCTTGTC CAGGAAAGCT TTGTACCAAA CAGTGAAGCC 4 680 

GCCCTGATCC GTGAGGTATG AGTATGACTC TGACCTTCAG CCCTCCCTAC AGCCGGGGGT 4 74 0 

GTGGCCCAGA GAAGCTTCCA GCACAGCCCT CTACCCAGAA CATCCGGGCT GGAGGGAGGC 4 8Q0 

TCCCAGTGAC TTTTCTGACA TTCCTAGACA GGTTCA7TCT 77GCTCAAGA AAGGCCTGAA 49 60 

TGACAATGTC CAGGATGTC7 GCACAACTGA GCAGCTCGCT CAC7CCCTAA AGAAACCTAT 4 9 20 

TGGCAGC77C AACAGGCAGG CAATAA7C7C 7TCCCAGAAC CACTGCAGTC AGGAA7AAAC 4 980 

7G7TTTCTCC ACCAGGCT7T GACAAAAGGG CCCACAGGAA TCTTACCAA7 GCCAACATTT 504 0 

CAAAGCACCC TA7T7CACG7 AGCA7AGCTT TCTGCTCCCC T7CCCCAAAG AGAGGTTATG 5100 

GAGGTACTGT AGCTTTTAGG GAAAAAAAAA ATGTTAACAC ATCACAGGTC AAGTTGAAGT 5160 

CATTC7CTGT T7AGGCACTA AAAATCGG7G TTGTCACTCA C7GTGTA7TA CCAGTATTTA 5220 
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CT7GCTTTCT TCATTTTACC AAAACCAAAT TTAATTTAAA GGACCACATT AATTTTTCAA 5230 

AGCGAAAGAG ACAAT7AATT GTACATAATG TATACACACA CAAAAAAAAA AATACCTGTA 5 340 

GAAATATTAT TCCAGCATAG CAGGAAAACA AACAAAAGTA TTGGACTCTC GGAGGTGAGC 54 00 

CTGTGCGTCT GTAACCCTT7. GTGACTCCTG ACCCTGCCC" GTCTTCTAGG TTAACTCACG 5-160 

AAGTACATTC TCTGTCTTAC TGATACTGTA GGTTCACCCA TTTTTTTTTA ATTTCCTCGC 5520 

AAATAACAAG ACCCACAGAA GTGACTCTAG CTACTTAATG CTTCTGTTCT TTTATATGCA 5580 

GCAAACACAC CGTCCATTTC 7GAAGAGGC7 TCGGCCTGAA GGCATTTTCC AATGATGTTA 564 0 

GTGCACAAAA CGCTTTAAAT TAGACTGGAA CTGCCAGAAT CAAATGTAAA TGAGGAATTT 57 00 

CTCGTACCCC 7ACTGCA7GG TATCGATTTT TAATAAAT7G 77GCAAATT7 GTTT7TATGA 57 60 

ATAAAAGGAA AAAACC7G7C GTCT77 573 6 
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(2) INFORMATION FOR SEQ "ID NO:2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5459 base pairs 

(B) TYPE: nucleic acid : ; . 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 



GGCCGTTCTG AGCGAGCAGA AGCCGGAGCC GAGCCGGGGC CCGTGCCGGC GCCATTGCGC 60 

GGGRGCCGCG GGCAAAKCTC CGCGCCGGGC GGCGGGCCGG GCCAGGCCAT GCGGGCCGAG 120 

rSAGCTCCCG CCCGCAGCCC GCGGCGCGGC ATGGCTTCCC CGCCGAGCTC CGGGCAGCCC 180 

CCCCCCCCGG CGCGGGCGCC GGGCGCCCCG CGCCTGCTOC TGCCCCTCCT QCTQTCCCTG 2 40 

craeTOTaoT Tascocccaa GoceTaosac tgoacgcgcg ccgccccccg gccgccgccc 300 

AGCAGCCCGC CGCTCTCCAT CATGGGCCTC ATGCCGCTCA CCAAGGAGGT GGCCAAGGGC 3 60 

AGCATCGGGC GCGGCGTGCT CCCCGCCGTG GAGCTAGCCA TCGAGCAGAT CCGCAACGAG 4 20 

TCACTCCTGC GCCCCTACTT CCTGGACCTG CGACTCTATG ACACCGAGTG TGACAATGCA 4 80 

AAGCGACTGA AAGCCTTCTA TGACGCAATA AAGTATGGGC CGAACCATTT GATGGTGTTT 540 

GGAGGCGTCT GTCCGTCTGT CACATCTATT ATCGCGGAGT CCCTCCAAGG CTGGAATCTG 600 

GTGCAGCTTT CCTTCGCCGC CACCACGCCT GTTCTTGCGG ATAAGAAGAA GTACCCGTAT 660 

TTCTTCCGGA CGGTGCCGTC AGACAACGCG GTGAACCCCG CCATCCTGAA GCTCCTGAAG 7 20 

CACTTCCGCT GGCGGCGTGT GGGCACACTC ACGCAGGACG YGCAGCGCTT CTCCGAGGTG 7 90 

A GGAATGACC TGACTGGGGT TCTGTATGGG GAAGATATTG AGATCTCAGA CACAGAGAGT 8 40 

TTCTCCAATG ATCCCTGCAC CAGCGTCAAA AAGCTCAAGG GGAATGACGT GCGGATCATC 900 

CTTGGCCAGT TTGACCAGAA TATGGCAGCA AAAGTCTTCT GTTGTGCCTT CGAGGAGAGC 960 

ATGTTTGGCA GCAAGTACCA GTGGATCATC CCGGGATGGT ACGAGCCTGC GTGGTGGGAG 1320 

CAGGTGCATG TGGAGGCCAA TTCCTCACGC TGCCTGCGCA GAAGCCTCCT GGCTGCCATG 1080 

GAAGGTTACA TCGGAGTGGA CTTTGAGCCC CTGAGCTCCA AACAAATCAA GACCATCTCA 1140 

GGGAAGACTC CACAGCAGTT TGAAAGAGAG TACAACAGCA AACGTTCAGG CGTGGGGCCC 12 00 

AGCAAGTTCC ATGGGTACGC CTACGATGGG ATCTGGGTCA TCGCCAAGAC CCTACAGAGG 12 60 

GCCATGGAGA CACTGCATGC CAGTAGCAGG CACCAGCGGA TCCAGGACTT CAACTACACA 1320 

GACCACACGC TGGGCAAAAT CATCCTCAAT GCCATGAACG AGACCAACTT CTTCGGGGTC 1380 

ACGGGTCAAG TTGTGTTCCG GAACGGGGAG AGAATGGGAA CCATTAAATT TACTCAATTT 14 40 

CAAGACAGCA GAGAGGTGAA GGTCGGCGAA TACAACGCGG TGGCTGACAC ACTGGAGATC 1500 

ATCAATGACA CCATAAGGTT CCAGGGGTCC GAGCCACCCA AGGACAAGAC CATCATTCTG 15 60 

GAGCAGCTTC GGAAGATCTC GCTTCCACTG TATAGCATCC TGTCCGCTCT CACCATCCTC 1520 

GGCATGATCA TGGCCAGCGC CTTCCTCTTC TTCAACA7CA AGAACCGGAA CCAAAAGCTG 1680 

ATTAAGATGT CAAGCCCCTA CATGAACAAC CTCATCATCC TGGGAGGAAT GCTGTCCTAT 17 40 

GCA7CCATCT TCCTCTTTGG CCTCGACGGG TCCTTCGTCT CAGAAAAGAC CTTTGAAACA 1800 

CTCTGCACGG TCCGGACCTG GATTCTCACC GTGGGCTACA CAACTGCCTT TGGGGCCATG I8 60 

TTTGCAAAGA CCTGGAGGGT CCATGCCATC TTCAAAAATG TGAAGATGAA GAAGAAGATC 19 20 

ATCAAAGACC AGAAGCTGCT TGTGATTGTG GGGGGCATGC TGCTCATCGA CCTGTGCATC 19 30 

CTGATCTGTT GGCAGGCTGT GGACCCCCTG CGGAGGACAG TGGAGAGGTA CAGCATGGAG 20 40 

CCGGACCCAG CAGGCCGGGA CATCTCCATC CGCCCATTGC TGGAACACTG CGAAAACACC 2100 

CACATGACCA TCTGGCTTGG CATTGTCTAC GCCTACAAGG GGCTCCTCAT GCTATTCGGT 2150 

TGTTTCTTGG CATGGGAAAC CCGCAATGTG AGCATCCCTG CCCTCAACGA CAGCAAGTAC 2220 

ATCGGCATGA GTGTGTACAA TGTGGGGATC ATGTGCATCA TCGGGGCTGC TGTCTCCTTC 22 80 

CTGACGCGTG ACCAGCCCAA CGTGCAGTTC TGCATCGTGG CCCTGGTCAT CATCTTCTGC 2 34 0 

RGCACCATCA CTCTCTGCCT GGTGTTTGTG CCAAAGCTCA TCACTCTGAG GACAAACCCT 2100 

GACGCAGCCA CTCAGAACAG GCGGTTCCAG TTCACACAGA ACCAGAAGAA AGAAGATTCG 2 4 60 
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AAGACCTCCA CTTCAG7CAC CAGCGTG AAC CAGGCGAGCA CCTCACGCCT CGAGGGACTG 
CAGTCAGAAA ACCACCGCCT TCGAATGAAG ATCACAGAGC 7CGACAAAGA CT7GCAAGAA 
GTCACCATGC AGGTACAACA CACACCAGAG AAGACCACAT ACATCAAACA GAATCACTAC 
CAAGAGCTCA ACGACATCCT CAGCTTGGGC AACTTCACAG AGAGCACAGA 7CGAGGAAAC 
. GCCATTCTAA AAAATCACCT CGATCAAAAC CCCCAGCTCC AGTGGAACAC GACACAGCCC 
TCAAGAACAT GCAAAGACCC CATAGAAGAC ATCAACTCCC CGGAGCACAT CCAGCGCCGG 
CTGTCGCTCC AGCTCCCGAT CCTTCACCAC GCCTACCTCC CATCCA7CGG AGGCG7GGAT 
GCCAGCTGCG TCAGCCCCTG TGTCAGCCCT ACCGCCAGCC CTCGCCACAG ACACGTACCA 
CCCTCCTTCC GAGTCATGGT CTCGGGCCTG TAGGGGTGGG AGGCCTGGGG CCGGGGCCTC 
CCCGGGACAG CACCATGCTG GGCCAAGGCG CCTGCCACAG GCACACTGAC GGCGGCGAGA 
AGCTGGGCAC CATGCTGCCT CTCCAGACTG CTGGAATGGC GCTCAGGCAG AGCGGGACTC 
GGCACCGACC TCGAGCCTTA TCTGTGAAGG TCTTACTCTC ACAGAGGAGA GGAATGACAA 
TGACTTCTCC TTCTTGGCGT CTGCAAACAA AGAGGAGTTG GGATGTCTGA AACTTGCAAW 
AAC AAATC A A ACTCTAGACA AAGGAGAGAG GCCTCGGACT CCTGCTGTCC TCGCCAAGTG 
GCCAGAGCAA GGGCTCTGCA GAAATGGCCC TCGGCACTCT GCACACACGT CATCGGTGGT 
GCCCGCCCCC ACACACCACC CCTTCTCCAC YCTGGGGGAG AAGGTGAGCA GGGCCACCAG 
GTCCCCAGCA GCTGGCCCAG TCCACTTCAG ACTTCCTGGA ACAAGGGAGT CAGCAGGAAG 
GGTAGCCGGG CTCTGCCCTG GGGCTGGGGG AATCAGAGCA AGCAGCTGCG CATGCTCCCA 
GGGGACATCC ATCAGAAGAG AGATGTTTGT TACTCTCTGA TAAATGCCAC ACATTAACAC 
AATAACACCC GTTCCGGGGA CTGTGGGGAT TCAGGGCACC GCACTGCAGA CATGCTCTGC 
AGGATGCAAC AGACAG7CTG 7CATGCACC7 ACCG7G7CGG CCA7G77CTT G7GT7CAAA7 
77AGGGT7CC CGG7AATAAG GGGAACCA7C 7GAGCTAATC ATGGAA7G7C 7GCTCCCAGA 
AAACACGATA AAGAAAGAT7 CTGCACTTTA ACC7C7CCCA TGGGGGCCCA AGGGC7GCTG 
GGATTCCCCC 7C7CCCAACA CAC7GACTTT G7T7TTGACC AAAG7GAA7C AGGGGCAT7C 
TGCTAAAAG7 CA7C7CTG7G 77CCTGAGC7 AGAGC7GC7G GCCCAGGGCC CCC7CGCT7C 
CAGAAGGTAG CCC7CCAG7G CCCCTCCAGC CCTAACCCGT GAG7GCCCAC C7AGGG7G7C 
ACC7GAGGCG G7CARTCAGC AGGCTGCT7C CCA7CCA7CA CGACA7ACTT 7CC77GTAAT 
TGAACCCCAA AGAGCAC7GG CCTGGGAG7C ACACCTGGA7 7TGAG7CC7C AG7CCCTTCA 
GACTC7C7CC G7GACC77GG GCAAG7CACC AGTG7C7CTC 7GACCC7CGG 777CCCCTCT 
G7AACAGAC7 GGGGTTGAAA TAACACCCAT CCTGCC7ACC 7CACAGGGTC AC7C7GAGCA 
7GGACAC7TG ATCTCA77CA 7CCAGGAGAG CTTTGCACTG GGAAGCAGC7 C7GACC7G7G 
AGG7A77AG7 ACGACCA7GA CCT7CAGCCC 7CCCACCAGC TGGGGAGCGT GCGG7GCGGC 
CCAGAGAAAC 7TCCTGCAAG GCCC7C7ACC TAGGGACAGG AACAGCCCAG 7CAGAGGCAG 
GC7CAGGACT C77C7GACA7 7CC77AATAG G7TCA7TCTT TGC7AAAGGC C7GAA7GGCA 
CCP.7CCAGGG GCGTC7GCAC AAACGAGTGG 7TTGCTCAC7 CCC7AAAGAA 7CC7AAAGGC 
AGCT7CAACA GGCAGGCAA7 AATCTCTTCC CAGACCAC7G CAG7CAGGAA CACAC7GC7G 
TCAACCACCA GGCTGTGACG AAAGGGCCCA CGGGAA7CAC CA7CGCCAAC ATTTCAAAAC 
ACCC7AG77C ACG7AGCA7A GC7CT7CC7C CCCA7CCC7C AAAGAGAGGT 7ACAGAGG7T 
C7GTAGC7TT 7AAGGGGGGA AAAATG7CAA CACA7CACAG GTCAAG77GA 7GTCC7TC7C 
7G77TAGGCA CCAAAACTCC GTG77GTCAC 7CAC7G7G7G 7GTTAACCAG 7A7TTCCTTG 
C7T77TCGAC TTCACCAAAA CCAAAT7TCA 777AAAAGAC CGYATTAGTT 7TTTTCAAAG 
GGAAAGAGAT AAT7AACTGT ACATAATGTA 7ACAGACAAA ACAAAACC7G 7AGAAATATT 
ACTCCAGCA7 AGCAGGAAAA AAAAAAAAAC AAAAAACAAA AAACAAAAG7 A77GGA7TG7 
CGGAGGTGAG CG7GCG7C7G 7AATCCCTTG 7GACTCCGAA C7G7GC7CCG 7C77C7AGGT 
7AACCCACAA GA7ACATCTC 7G7CTACTGA 7CC7GTAGG7 7CACCCA7T7 T7TT77AA77 
7C7CTGCAA7 7AACAAGCCC CACAAAGTGA 7TC7GGC7A7 C7AC7GG77C 7GT7C77T7A 
TA7GCAGCAA ACACACGG7C CATT7CCTAG AGGCTTCCGC CCAAGGGCA7 CTTCCAGTGA 
CGC7AG7GCG CACACACACA CACACACACA CACACACACA CACACACACA CACACACGAC 
TGCTTTCAAT TAGACCGGAA C7GGCAGAA7 CAAA7G7AAA 7GAGGAG777 C7CTCGCCCC 
AC7GCA7GG7 A7CGATTTT7 AA7AAA77G7 7GCAAA777G T77TTA7GAA 7AACAGGGG 
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(2) INFORMATION FOR SEQ ID NO:3: 

fi) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 940 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 
(ii) MOLECULE TYPE: None 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:3: 

Met Ala Ser Pro Arg Ser Ser Gly Gin Pro Gly Pro Pro Pro Pro Pro 

1 5 10 lS 

Pro Pro Pro Pro Ala Arg Leu Leu Leu Leu Leu Leu Leu Pro Leu Leu 

20 25 30 

Leu Pro Leu Ala Pro Gly Ala Trp Gly Trp Ala Arg Gly Ala Pro Arg 

35 40 45 

Pro Pro Pro Sec Ser Pro Pro Leu Ser lie Met Gly Leu Met Pro Leu 

50 55 60 

Thr Lys Glu Val Ala Lys Gly Ser lie Gly Arg Gly Val Leu Pro Ala 
55 70 75 80 

Val Glu Leu Ala He Glu Gin He Arg Asn Glu Ser Leu Leu Arg Pro 

35 90 95 

Tyr Phe Leu Asp Arg Leu Tyr Asp Thr Glu Cys Asp Asn Ala Lys Gly 

100 105 U0 

Leu Lys Ala Phe Tyr Asp Ala lie Lys Tyr Gly Pro Asn His Leu Met 

115 120 125 

Val Phe Gly Gly Val Cys Pro Ser Val Thr Ser Me lie Ala Glu Ser 

130 135 140 

Leu Gin Gly Trp Asn Leu Val Gin Leu Ser Phe Ala Ala Thr Thr Pro 
145 150 155 160 

Val Leu Ala Asp Lys Lys Lys Tyr Pro Tyr Phe Phe Arg Thr Val Pro 

165 170 175 

Ser Asp Asn Ala Val Asn Pro Ala He Leu Lys Leu Leu Lys His Tyr 

180 195 190 

Gin Trp Lys Arg Val Gly Thr Leu Thr Gin Asp Val Gin Arg Phe Ser 

195 200 205 

Glu Val Arg Asn Asp Leu Thr Gly Val Leu Tyr Gly Glu Asp He Glu 

210 215 220 

lie Ser Asp Thr Glu Ser Phe Ser Asn Asp Pro Cys Thr Ser Val Lys 
225 230 235 240 

Lys Leu Lys Gly Asn Asp Val Arg He He Leu Gly Gin Phe Asp Gin 

245 250 255 

Asn Met Ala Ala Lys Val Phe Cys Cys Ala Tyr Glu Glu Asn Met Tyr 

260 265 270 

Gly Ser Lys Tyr Gin Trp He He Pro Gly Trp Tyr Glu Pro Ser Trp 

260 235 
Tr? Glu Gin Val His Thr Glu Ala Asn Ser Ser Arg Cys Leu Arg Lys 

3?C 295 300 

Asr. Leu Leu Ala Ala Met Glu Gly Tyr He Gly Val Asp Phe Glu Pro 
30= 310 315 320 
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Leu Sec Sec Lys Gin lie Lys Thr lie Ser Gly Lys The Fro Glr. Gin 

325 330 335 

Tyc Glu Arg Glu Tyr Asn Asn Lys Acq Sec Gly Val Gly Pco Sec Lys 

340 315 350 

Phe His Gly Tyc Ala Tyr Asp Gly lie Tcp Val He Ala Lys The Leu 

355 360 365 

Gin Arg Ala Met Glu Thr Leu His Ala Ser Sec Arg His Gin Acg lie 

370 375 380 

Gin Asp Phe Asn Tyr Thr Asp His Thr Leu Gly Arg lie He Leu Asn 
385 390 395 <jqo 

Ala Met Asn Glu Thr Asn Phe Phe Gly Val Thr Gly Gin Val v a *l Phe 

105 410 415 

Arg Asn Gly Glu Arg Met Gly Thr He Lys Phe Thr Gin Phe Gin Asp 

420 425 430 

Ser Arg Glu Val Lys Val Gly Glu Tyr Asn Ala Val Ala Asp Thr Leu 

435 440 445 

Glu He He Asn Asp Thr He Arg Phe Gin Gly Ser Glu Pro Pro Lys 

450 455 460 

Asp Lys Thr He He Leu Glu Gin Leu Arg Lys He Ser Leu Pro Leu 
465 470 475 480 

Tyr Ser lie Leu Ser Ala Leu Thr He Leu Gly Met He Met Ala Ser 

485 490 495 

Ala Phe Leu Phe Phe Asn He Lys Asn Arg Asn Gin Lys Leu lie Lys 

500 505 510 

Met Ser Ser Pco Tyr Met Asn Asn Leu He He Leu Gly Gly Met Leu 

515 520 525 

Ser Tyr Ala Ser He Phe Leu Phe Gly Leu Asp Gly Sec Phe Val Ser 

530 535 540 

Glu Lys Thr Phe Glu Thr Leu Cys Thr Val Arg Thr Trp He Leu thr 
545 550 555 560 

val Gly Tyr Thr Thr Ala Phe Gly Ala Met Phe Ala Lys Thr Trp Arg 

565 570 575 

val His Ala lie Phe Lys Asn Val Lys Met Lys Lys Lys He He Lys 

580 585 590 

Asp Gin Lys Leu Leu Val He Val Gly Gly Met Leu Leu He Asp Leu 

595 600 605 

Cys He Leu He Cys Trp Gin Ala Val Asp Pro Leu Arg Arg Thr Val 

610 615 620 

Glu Lys Tyr Ser Met Glu Pro Asp Pro Ala Gly Arg Asp He Ser He 
625 630 635 640 

Arg Pro Leu Leu Glu His Cys Glu Asn Thr His Met Thr He Trp Leu 

645 650 655 

Gly lie Val Tyr Ala Tyr Lys Gly Leu Leu Met Leu Phe Gly Cys Phe 

660 665 670 

Leu Ala Trp Glu Thr Arg Asn Val Ser He Pro Ala Leu Asn Asp Ser 

675 580 685 

Lys Tyr He Gly Met Ser Val Tyr Asn Val Gly He Met Cys He lie 

690 695 700 

Gly Ala Ala Val Ser Phe Leu Thr Arg Asp Gin Pro Asn Val Gin Phe 
7 °5 710 715 720 

Cys He Val Ala Leu Val lie He Phe Cys Ser Thr He Thr Leu Cys 

725 • 730 735 

Leu Val Phe Val Pro Lys Leu He Thr Leu Arg Thr Asn Pro Asp Ala 
? 40 745 750 
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Aia Thr Gin Asn Aro Arg Fhe Gin Phe The Gin Asn Gin Lvs Lvs Gl 11 

' ^60 765 

Asp Ser Lys Thr Ser Thr Ser val Thr Sec Vai Asn Gin Ala Se: Thr 

770 7~5 780 

Ser Arg Leu Glu Gly Leu Gin Ser Glu Asn His Arg Leu Arg Met Lys 
735 790 795 800 

He Thr Glu Leu Asp Lys Asp Leu Glu Glu Val Thr Met Gin Leu Gin 

80S 9io 815 

Asp Thr Pro Glu Lys Thr Thr Tyr He Lys Gin Asn His Tyr Gin Glu 

820 825 830 

Leu Asn Asp He Leu Asn Leu Gly Asn Phe Thr Glu Ser Thr Asp Gly 

835 840 845 

Gly Lys Ala rie Leu Lys Asn His Leu Asp Gin Asn Pro Gin Leu Gin 

850 855 860 

Trp Asn Thr Thr Glu Pro Ser Arg Thr Cys Lys Asp Pro He Glu Asp 
865 870 875 88Q 

He Asn Ser Pro Glu His He Gin Arg Arg Leu Ser Leu Gin Leu Pro 

885 ago 895 

He Leu His His Ala Tyr Leu Pro Ser lie Gly Gly Val Asp Ala Ser 

900 905 910 

Cys Val Ser Pro Cys Val ser Pro Thr Ala Ser Pro Arg His Arg His 

915 920 925 

Val Pro Pro Ser Phe Arg Val Met Val Ser Gly Leu 
930 93= 940 
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(2) INFORMATION FOR SEQ ID NO:4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 940 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: None 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 

Met Ala Ser Pro Pro Ser Ser Gly Gin Pro Arg Pro Pro Pro Pro Pro 

15 10 15 

Pro Pro Pro Ala Arg Leu Leu Leu Pro Leu Leu Leu Ser Leu Leu Leu 

20 2S 30 

Trp Leu Ala Pro Gly Ala Trp Gly Trp Thr Arg Gly Ala Pro Arg Pro 

35 40 45 

Pro Pro Ser Ser Pro Pro Leu Ser lie Met Gly Leu Met Pro Leu Thr 

50 55 60 

Lys Glu Val Ala Lys Gly Ser He Gly Arg Gly Val Leu Pro Ala Val 

65 7 o 75 ao 

Glu Leu Ala He Glu Gin He Arg Asn Glu Ser Leu Leu Arg Pro Tyr 

85 go 95 

Phe Leu Asp Leu Arg Leu Tyr Asp Thr Glu Cys Asp Asn Ala Lys Gly 

100 105 HO 

Leu Lys Ala Phe Tyr Asp Ala lie Lys Tyr Gly Pro Asn His Leu Met 

115 120 125 

Val Phe Gly Gly Val Cys Pro Ser Val Thr Ser Tie He Ala Glu Ser 

130 135 140 

Leu Gin Gly Trp Asn Leu Val Gin Leu Ser Phe Ala Ala Thr Thr Pro 
H5 150 155 160 

Val Leu Ala Asp Lys Lys Lys Tyr Pro Tyr Phe Phe Arg Thr Val Pro 

165 170 175 

Ser Asp Asn Ala Val Asn Pro Ala He Leu Lys Leu Leu Lys His Phe 

180 185 190 

Arg Trp Arg Arg Val Gly Thr Leu Thr Gin Asp Xaa Gin Arg Phe Ser 

195 200 205 

Glu val Arg Asn A3p Leu Thr Gly Val Leu Tyr Gly Glu Asp He Glu 

2i0 215 220 

He Ser Asp Thr Glu Ser Phe Ser Asn Asp Pro Cys Thr Ser Val Lys 
225 230 235 240 

Lys Leu Lys Gly Asn Asp v a l Arg He He Leu Gly Gin Phe Asp Gin 

245 250 255 

Asn Met Ala Ala Lys Val Phe Cys Cys Ala Phe Glu Glu Ser Met Phe 

260 265 270 

Gly Ser Lys Tyr Gin Trp He He Pro Gly Trp Tyr Glu Pro Ala Trp 

275 280 285 

Trp. Glu Gin vai His Val Glu Ala Asn Ser Ser Arg Cys Leu Arg Arg 

290 295 300 

Ser Leu Leu Ala Ala Met Glu Gly Tyr He Gly Val Asp Phe Glu Pro 
305 310 315 320 

Leu Ser Ser Lys Gin He Lys Thr He Ser Gly Lys Thr Pro Gin Gin 
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225 330 33S 

Phe Glu Arg Clu Tyr Asn Ser Lys Arg Ser Giy Vai Giy Pro Ser Lys 

340 345 1 SO 

Phe His Cly Tyr Ala Tyr Asp Giy lie Trp Vai lie Ala Lys Thr Leu 

355 . 360 365 

Gin Arg Ala Mec Glu Thr Leu His Ala Ser Ser Arg His Gin Arg lie 

370 375 360 

Gin Asp Phe Asn Tyr Thr Asp His Thr Leu Giy Lys lie lie Leu Asn 
385 390 395 40 0 

Ala Mec Asn Glu Thr Asn Phe Phe Giy Vai Thr Giy Gin Vai Vai Phe 

405 410 415 

Arg Asn Giy Glu Arg Met Giy Thr lie Lys Phe Thr Gin Phe Gin Asp 

420 425 430 

Ser Arg Glu Vai Lys Vai Giy Glu Tyr Asn Ala Vai Ala Asp Thr Leu 

435 440 445 

Glu lie He Asn Asp Thr He Arg Phe Gin Giy Ser Glu Pro Pro Lys 

450 455 460 

Asp Lys Thr He He Leu Glu Gin Leu Arg Lys He Ser Leu Pro Leu 
465 470 475 480 

Tyr Ser He Leu Ser Ala Leu Thr He Leu Giy Met He Met Ala Ser 

485 490 495 

Ala Phe Leu Phe Phe Asn He Lys Asn Arg Asn Gin Lys Leu lie Lys 

500 505 510 

Met Ser Ser Pro Tyr Met Asn Asn Leu He lie Leu Giy Giy Met Leu 

515 520 525 

Sec Tyr Ala Ser He Phe Leu Phe Giy Leu Asp Giy Ser Phe Vai Ser 

530 535 540 

Glu Lys Thr Phe Glu Thr Leu Cys Thr Vai Arg Thr Trp lie Leu Thr 
545 550 555 560 

Vai Giy Tyr Thr Thr Ala Phe Giy Ala Met Phe Ala Lys Thr Trp Arg 

565 570 575 

vai His Ala He Phe Lys Asn vai Lys Met Lys Lys Lys He He Lys 

580 585 590 

Asp Gin Lys Leu Leu Vai He Vai Giy Giy Met Leu Leu He Asp Leu 

595 600 605 

Cys lie Leu He Cys Trp Gin Ala Vai Asp Pro Leu Arg Arg Thr Vai 

610 61S 620 

Glu Arg Tyr Ser Met Glu Pro Asp Pro Ala Giy Arg Asp He Ser He 
€25 630 635 5 40 

Arg Pro Leu Leu Glu His Cys Glu Asn Thr His Met Thr He Trp Leu 

645 650 655 

Giy He Vai Tyr Ala Tyr Lys Giy Leu Leu Met Leu Phe Giy Cys Phe 

660 665 670 

Leu Ala Trp Glu Thr Arg Asn Vai Ser He Pro Ala Leu Asn Asp Ser 

675 660 685 

Lys Tyr He Giy Met Ser Vai Tyr Asn Vai Giy He Met Cys He He 

690 695 700 

Giy Ala Ala VaJ Ser Phe Leu Thr Arg Asp Gin Pro Asn Vai Gin Phe 
7 05 710 715 720 

Cys He vai Ala Leu Vai He He Phe Cys Ser Thr He Thr Leu Cys 

725 730 735 

Leu Vai Phe Vai Pro Lys Leu He Thr Leu Arg Thr Asn Pro Asp Ala 

740 745 750 

Ala Thr Gin Asn Arg Arg Phe Gin Phe Thr Gin Asn Gin Lys Lys Glu 
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Asp Set 
770 
Ser Arg 
785 

He Thr 

Asp Thr 

Leu Asn 

Gly Lys 
8S0 
Trp Asn 
365 

IXe Asn 



Cys Val 

Val Pro 
930 



Lys Thr 

Leu Glu 

Glu Leu 

Pro Glu 
820 
Asp He 
835 

Ala He 

Thr Thr 

Ser Pro 

His His 
900 
Ser Pro 
915 

Pro Ser 



Ser Thr 

GLy Leu 
790 
Asp Lys 
805 

Lys Thr 

Leu Ser 

Leu Lys 

Glu Pro 
870 
Glu His 
885 

Ala Tyr 
Cys val 
Phe Arg 



7 60 
Ser Val Thr 

775 

Gin Ser Glu 

Asp Leu Glu 

Thr Tyr He 
825 

Leu Gly Asn 

840 
Asn His Leu 
855 

Ser Arg Thr 
He Gin Arg 

Leu Pro Ser 

905 

Ser Pro Thr 

920 
val Met val 
935 



765 

Ser val Asn Gin 
780 

Asn His Arg Leu 
795 

Glu Val Thr Met 
810 

Lys^ Gin Asn His 

Phe Thr Glu ser 
845 

Asp Gin Asn Pro 
8 60 

Cys Lys Asp Pro 
875 

Arg Leu Ser Leu 
890 

He Gly Gly Val 

Ala Ser Pro Arg 
925 

Ser Gly Leu 
940 



Ala Ser The 

Arg Met Lys 
800 

Gin Leu Gin 

815 
Tyr Gin Glu 
830 

Thr Asp Gly 

Gin Leu Gin 

He Glu Asp 
880 

Gin Leu Pro 

8 95 
Asp Ala Ser 
910 

His Arg His 
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CLAIMS 

We claim: 

5 1. An isolated and essentially pure DNA molecule which encodes a 

mammalian gb2 GABA B receptor. 

2. The isolated and essentially pure DNA molecule of claim 1, wherein the 
DNA molecule is a cDNA molecule. 

10 

3. The isolated and essentially pure DNA molecule of claim 2 t wherein the 
mammalian gb2 GABA B receptor is a human gb2 GABA B receptor. 

4. The isolated and essentially pure DNA molecule of claim 2, wherein the 

15 cDNA molecule has the sequence given by SEQ. ID NO.: 1 or allelic variations thereof. 

5. The isolated and essentially pure DNA molecule of claim 2, wherein the 
mammalian gb2 GABA B receptor is a rat gb2 GABA B receptor. 

20 6. The isolated and essentially pure DNA molecule of claim 2, wherein the 

cDNA molecule has the sequence given by SEQ. ID NO.:2 or allelic variations thereof. 

7. An isolated mammalian GABA B receptor of subtype gb2. 

25 8. The isolated mammalian GABA B receptor of claim 7, wherein the 

mammalian gb2 GAB A B receptor is a human gb2 GAB A B receptor. 

9. The isolated mammalian GABA B receptor of claim 7, wherein the human 
gb2 GABA D receptor has the sequence given by SEQ. ID NO.:3 or allelic variations 
30 thereof. 
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10. The isolated mammalian GABA B receptor ofclaim 7. wherein the 
mammalian gb2 GABA„ receptor is a rat gb2 GABA„ receptor. 

11. The isolated mammalian GABA B receptor of claim 7, wherein the rat gb2 
GABA B receptor has the sequence given by SEQ. ID NO.:4 or allelic variations 
thereof. 



1 2. A recombinant DNA comprising a DNA molecule which encodes a 
mammalian gb2 GABA B receptor and which is operably linked to control nucleotide 
sequences capable of expressing the mammalian gb2 GABA„ receptor in a compatible 
host. 

13. The recombinant DNA of claim 12, wherein the mammalian gb2 GABAe 
receptor is a human mammalian gb2 GABA B receptor. 

14. An isolated or pure clone from a mammalian tissue DNA library, wherein 
the clone contains DNA which encodes a mammalian gb2 GABA B receptor. 

1 5. The isolated or pure clone of claim 14, wherein the clone is from a human 
tissue library and the DNA encodes a human gb2 GABA 9 receptor. 

16. A sequencing vector for sequencing DNA which encodes a mammalian 
gb2 GABA B receptor, said vector comprising a sequencing vector and DNA, which 
encodes a mammalian gb2 GABA B receptor, inserted into the vector in proper 
orientation and correct reading frame for sequencing the DNA. 

17. The sequencing vector of claim 16, wherein the DNA encodes a human 
gb2 GABA B receptor. 
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18. An expression vector for expressing DNA that codes for a mammalian gb2 
GABA B receptor in a compatible host, said expression vector comprising a vector 
capable of transforming a procaryotic or eucaryotic cell and DNA which encodes a 
mammalian gb2 GABA B receptor inserted into the vector in proper: orientation and 

5 correct reading frame for expression. 

19. The expression vector of claim 18. wherein the DNA encodes a human gb2 
GABA B receptor. 



10 



15 



25 



20. The expression vector of claim 19 wherein the cell is a mammalian cell. 

21. A process for producing a mammalian gb2 GABA B receptor, said method 
comprising expressing the expression vector of claim 18 in the cell transformed by the 



vector. 



22. A cell transformed with the expression vector of claim 18. 



23. A mammalian cell line comprising a mammalian cell stably transfected 
with the expression vector of claim 1 8 and capable of continuous growth in a suitable 

20 culture medium. 

24. The mammalian ceil line of claim 23 wherein the mammalian cell is 
selected from the group consisting of a neuronal cell, a fibroblast ceil, an epithelial- 
like cell, and a glial cell. 



25. The mammalian cell line of claim 23 wherein the cells of the cell line 
express a human gb2 GABA B receptor having the amino acid sequence of SEQ. ID 
NO.:3 and allelic variants thereof. 



26. A process for producing a human gb2 GAB A B receptor, said process 
comprising culturing the mammalian cell line of claim 25 in a suitable culture 
medium. 
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27. A method of making a mammalian cell that expresses a mammalian gb2 
GABA B receptor, said method comprising the steps of: 

transfecting a mammalian ceil that does not express the receptor with the 
expression vector of claim 1 82; and 

recovering said transfected mammalian cell. 

28. The method of claim 27, wherein the mammalian cell that does not express 
the receptor is taken from a cell line that is capable of continuous growth in a suitable 
culture medium and comprising the further step of cloning the transfected cell, thereby 
producing a cell line that is capable of continuous growth in a suitable culture medium, 
the cell line expressing mammalian gb2 GABA B receptor. 

29. The method of claim 28, wherein the mammalian gb2 GABA B receptor is a 
human gb2 GABA B receptor. 

30. The mammalian cell produced by the method of claim 29 

3 1 . The mammalian cell of claim 30, wherein the mammalian gb2 GABA B 
receptor is a human gb2 GABA B receptor. 

32. In a method for the pharmacological, physiological, functional, or other 
investigational analysis of a mammalian gb2 GABA n agonist or antagonist the 
improvement comprising the use of an effective amount of the mammalian ceil of 
claim 30 in the pharmacological, physiological, functional, or other investigational 
analysis. 



33. An in vitro method of determining the ability of a chemical to bind to a 
mammalian gb2 GABA B receptor, said method comprising the steps of: 

contacting the chemical with a cell or part thereof from the mammalian cell of 
claim 23; and 

determining the ability of said cell or part thereof to bind the chemical. 
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34. An in vitro method for evaluating a chemical to determine if it is a 
mammalian gb2 GABA G receptor agonist, said method comprising the steps of: 

contacting the chemical with a cell or part thereof from the mammalian cell 
line of claim 23; and 

measuring the effect of the chemical on a second messenger pathway of the cell 
or part thereof. 
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hgb2 MASPRSSGQPGPPPPPPPPPARLLLLLLLPLLL 

rgb2 masppssgqpr-pppppppparlllplllslll 

rgbl a mlllllvplflrplgaggaqtpnatsegcq i i hppweggi 
rgblb mgpggpctpvgwplplllvmaagvapvwa - 

o o o 

hgb2 

rgb2 - - 

rab1a RYRGLTRDQVKAI NFL PVDYE I EYVCRGEREVVGPKVRKCLANGSWTDMDTPS 

rgblb - 

hgb2 - - 

rgb2 

rgbl a rcvr i csksyltlengkvfltggdlpaldgarvefrcdpdfhlvgssrsvcsq 

rgblb - s- 

x 

hgb2 PLAPG-AWGWARGAPRPPPSSPPLS I MGLMPLTKEVAW3S I GRGVLPAVELAI 
rgb2 WLAPG-AWGWTRGAPRPPPSSPPLS IMGLMPLTKEVAKGS I GRGVLPAVELAI 

rgbl a gqwstp-kphcqvnrtphserravyigalfpms- ggwpg-gqacqpavemal 
rgbl b hs-phlprphprvpphpsserravyigalfpms- ggwpg-gqacqpavemal 

o o o o o o oooo o 

hgb2 eqi rn-esllrpyfldlrlydtecdnakglkafydai kygpnhlmvfggvcps 
rgb2 eqi rn-esllrpyflolrlydtecdnakglkafydaikygpnhlmvfggvcps 
rgbl edvnsrrdi lpdyelkli hhdskcdpgqatkylyellyndpik 1 1 lmpg-css 

o o o o o o oo o o o o o o 

hqb2 VTS 1 1 AESLQGWNLVQLSFAAnPVLADKKKYPYFFRTVPSDNAVNPAl LKLL 
rqb2 VTS 1 1 AESLQGVWLVQLSFMTTWLADKKKYPYFFRTVPSDNAVNPAI LKLL 
rgb 1 VSTLVAEAARMWNL I VL SYGSSS PALSNRQRFPTFFRTHPSATLHNPTRVKLF 

o oo ooo oo o o o oooo oo oo OO 

hqb2 KHYQWKRVGTLTQDVQRFSEVRNDLTGVLYGED I E I SDTESFSNDPCTSVKKL 

rqb2 khfrwrrvgtltqdcqrfsevrndltgvlygedi E I SDTESFSNDPCTSVKKL 

rgb 1 EKWGWKK I ATI QQTTEVFTSTLDDLEERVKEAGI E I TFRQSFFSDPAVPVKNL 

o o o o oo ooo oo oo oo o 

hab2 KGNDVRI I LGQFDQNMAAKVFCCAYEENMYGSKYQWI I PGWYELSWWEQVHTE 
rab2 KGNDVR 1 1 LGQFDQNMAAKVFCCAFEESMFGSKYQWI I PGWYEPAWWEQVH VE 
r |b1 KRQDARI I VGLFYETEARKVFCEVYKERLFGKKYVWFL I GWYADNWFK — -TY 

^ o o ooo o o o oooo o o oo o ooo o 

* 

hgb2 ANSSRCLRKNLLAAMEGY I GVDFEPLSSKQ I KT I SGKTPQQY-EREYNN-KRS 
rgb2 ANSSRCLRRSLLAAMEGY I GVDFEPLSSKQ I KT I SGKTPQQF-EREYNS-KRS 
rgbl DPS I NCTVEEMTEAVEGHITTE I VMLNPANTRS I SNMTSQEFVEKLTKRLKRH 

F 

hqb2 GVGPSKFH-G-YAYDG I WV I AKTLQRAMETLHASSRHQR I QDFNYTDHTLGR I 

rqb2 gvgpskfh-g-yaydg i wv i aktlqrametlhassrhqr i qdfnytdhtlgk i 

rgbl PEETGGFQEAPLAYDA I WALALALNKTSGGGGRSGV--RLEDFNYNNQTI TDQ 

o ooo oo o o o o oooo o 
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hgt>2 I LNAMNETN FFGVTGQVVFR - NGERMGT I KFTQFQDSREVKVGEYNAVADTLE 
rgb2 I LNAMNETNFFGVTGQVVFR- NGERMGT I KFTQFQDSREVKVGEYNAVADTLE 
rgbl | YRAMNSSSFEGVSGHVVFDASGSRMAWTL I EQLQGGSYKK I GYYDSTKDDL- 

o ooo o oo o ooo o oo o o o o o o o 

TM1 

hqb2 I INDTIRFQGSEPPKDKTI I LEQLRK I SLPLYS I LSALT I LGM I MASAFLFFN 
rgb2 1 1 NDTI RFQGSEPPKDKTI I LEQLRK I SLPLYS I LSALT I LGM I MASAFLFFN 
rgbl SWSKTDKWI GGSPPADQTLVIKTFRFLSQKLFI SVSVLSSLGIVLAWCLSFN 

o o oo o o ooo o o oo o o oo 

TM2 TM3 

hgb2 I KNRNQKL I KMSSPYMNNL 1 1 LGGMLSYAS I F LFGLDGSFVSEKTFETLCTVR 
rgb2 | KNRNQKL I KMSSPYMNNL 1 1 LGGMLSYAS IF LFGLDGSFVSEKTFETLCTVR 
rgbl | YNSHVRY I QNSQPNLNNLTAVGCSLALAAVFPLGLDGYH I GRSQFPFVCQAR 

o o ooo ooo oooo oooo o o o 

TM4 



hgb2 TWI LTVGYTTAFGAMFAKTWRVHA I F --KNVKM-KKK I IKDQKLLVlVGGMLL 

rgb2 twi ltvgyttafgamfaktwrvha if--knvkm-kkki i kdqkllvivggmll 
rgbl lwllglgfslgygsmftk iwwvhtvftkkeekkewrktlepwklyatvgllvg 

ooo o oo o a oo ooo a oo oo 

TM5 

hgb2 I dlci licwqavdplrrtvekysmepdpagrdisirpllehcenthmtiwlgi 
rgb2 | dlci licwqavdplrrtverysmepdpagrdisirpllehcenthmtiwlgi 
rgbl mdvltla i wq i vdplhrti etfakeepkedi dvs i lpqlehcsskkmntwlgi 

O 0 oo oooo oo o o o oo o oooo o oooo 

TM6 " 



hab2 vyaykgllmlfgcflawetrnvsipalndskyigmsvynvgimci igmvsfl 
rgb2 vyaykgllmlfgcflawetrnvs i palndsky i gmsvynvg i mc i igaavsfl 
rgbl FYGYKGLLLLLG I flayetksvstek i ndhravgmai ynvavlcl ITAPVTM I 

o ooooo o o ooo oo oo oo oo ooo oooo 

TM7 

hgb2 trdqpnvqfc ivalv i i fcstitlclvfvpkli tl-rtnpdaatqn-rrf-q 
rgb2 trdqpnvqfc ivalv i i fcstitlclvfvpkli tl-rtnpdaatqn-rrf-q 
rgbl lssqqdaafafasla i vfssy i tlwl fvpkmrrl i trgewqsetqdtmktgs 

o o oooo ooo oooo O O 00 

hgb2 ftqnqkkedsk-tstsvtsvnqastsrleglqsenhrlrmk I teldkdleevt 
rqb2 ftqnqkkedsk-tstsvtsvnqastsrleglqsenhrlrmk I teldkdleevt 
rgbl stnnneeeksrllekenreleki iaekeerv-selrhqlqsrqqlrsrrhppt 



oooo OOO 



hgb2 MQLQDTPEKTTY I KQNHYQELNDI LNLGNFTESTDGGKA I LKNHLDQNPQLQW 

rgb2 mqlqdtpektty i kqnhyqelndi lslgnftestdggka i lknhldqnpqlqw 
rgbl ppdpsgglprgpseppdrlscdgsrvhllyk 

hgb2(a) nttepsrtckdpi edi nspeh iqrrlslqlp i lhhaylps iggvdascvs ^S' ^ 

hgb2 b) nttepsrtckdpi edi nspeh iqrrlslqlp i lhh - 

hgb2(c] nttepsrtckdpiedinspehiqrrlslqlpilhhaylpsiggvdascvs 
rgb2 nttepsrtckdpi edi nspeh iqrrlslqlp i lhhaylps iggvdascvs 
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hgb2 a) pcvsptasprhrhvppsfrvmvsgl 

hgb2b] rhvppsfrvmvsgl 

hgb2(C) PCVSPTASPRHRTTLGRGVCCRNTVGSGCGEAGHHGWPLRTTRMALRWTG 

rgb2 pcvsptasprhrhvppsfrvmvsgl 

hgb2(c) rgrgrlgt 



Fig. 2C 



SUBSTITUTE SHEET (RULE 26) 



WO 99/61606 



PCT/US99/11869 



5/11 

no 4^ -si co _ 
• • • ■ 

en cn cr 

I III 




heart 

brain 

placenta 

lung 

liver 

skeletal muscle 

kidney 

pancreas 

amygdala 
caudate nucleus 
corpus callusum 
hippocampus 
total brain 
substantia nigra 

subthalamic nucleus 
thalamus 

cerebellum 
cerebral cortex 
medulla 
spinal cord 
occipital pole 
frontal lobe 
temporal lobe 
putamen 



SUBSTITUTE SHEET (RULE 26) 




SUBSTITUTE SHEET (RULE 26) 



WO 99/61 6<M> 



7/11 



PCT/US99/11869 



FIGURES 




SUBSTITUTE SHEET (RULE 26) 



WO 99/61 606 PCT/US99/VI 869 

8/11 



FIGURE 6 
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FIGURE 7 
HEK-293 rgbla stable cell lines 
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HEK-293 hgb2 stable cell lines 
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FIGURE 8 
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SEQUENCE LISTING 

<110> United States of America, represented by Sec. HHS 
Clark, Janet 
Bonner, Tom I . 

<120> Mammalian gb2 GABAB Receptors 



<130> 65879 

<150> SO/087,274 
<151> 1998-05-29 

<160> 30 

<170> FastSEQ for Windows Version 3.0 

<210> 1 
<211> 5786 
<212> DNA 

<213> Homo sapiens 
<40O> l 

gccgcccctg agcagcctcg ccttcgcctc ccgcgtttcc tgccgtccgc cctcccccgg 6 0 

ccgagcccca ggggctgccg cctagcagct cccggcggga gagcggttca gagcgcgcac 12 0 

gggggcgggc ggaggcggcc cggtgcgggg cggccgcgct ggagagaggc gcgcgcggag 18 0 

acgccggccc ccctctccgc gtttgctcgc ttgctccccg cctcccgcac tccgctcgct 24 0 

cccacccctt cccggcgtga ttgatccgtc acgggcgccg ccgctgccgc cgccgccgcg 300 

gccgttctga gccgagccgg aaccctagcc cgagacggag ccggggcccg ggccggcgcc 360 

attgcgcggg cgccgcggga agaccttggc gcggggcggc gggccgggcc aggccatgcg 420 

ggccgagtga gccggcgccc gcagcccgcg gcgcggcatg gcttccccgc ggagctccgg 480 

gcagcccggg ccgccgccgc cgccgccacc gccgcccgcg cgcctgctac tgctactgct 54 0 

gctgccgctg ctgctgcctc tggcgcccgg ggcctggggc tgggcgcggg gcgccccccg 60 0 

gccgccgccc agcagcccgc cgctctccat catgggcctc atgccgctca ceaaggaggt 66 0 

ggccaagggc agcatcgggc gcggtgtgct ccccgccgtg gaactggcca tcgageagat 72 0 

ccgcaacgag tcactcctgc gcccctactt cctcgacctg cggctctatg acacggagtg 780 

cgacaacgca aaagggttga aagccttcta cgatgcaata aaatacgggc cgaaccactt 84 0 

gatggtgttt ggaggcgtct gtccatccgt cacatccatc attgcagagt ccctccaagg 90 0 

ctggaatctg gtgcagcttt cttttgctgc aaccacgcct gttctagccg ataagaaaaa 96 0 

atacccttat ttctttcgga ccgtcccatc agacaatgcg gtgaatccag ccattctgaa 1020 

gttgctcaag cactaccagt ggaagcgcgt gggcacgctg acgcaagacg tucagaggtt 108 0 

ctctgaggtg cggaatgacc tgactggagt tctgtatggc gaggacattg agatttcaga 114 0 

caccgagagc ttctccaacg atccctgtac cagtgtcaaa aagctgaagg ggaatgatgt 12 0 0 

gcggatcatc cttggccagt ttgaccagaa tatggcagca aaagtgttct gttgtgcata 1260 

cgaggagaac atgtatggta gtaaatatca gtggatcatt ccgggctggt acgagccttc 13 2 0 

ttggtgggag caggtgcaca cggaagccaa ctcatcccgc cgcctccgga agaatctgct 13 80 

tgctgccatg gagggctaca ttggcgtgga tttcgagccc ctgagctcca agcagatcaa 1440 

gaccatctca ggaaagactc cacagcagta tgagagagag tacaacaaca agcggtcagg 150 0 

cgtggggccc agcaagttcc acgggtacgc ctacgatggc atctgggtca tcgccaagac 1560 

actgcagagg gccatggaga cactgcatgc cagcagccgg caccagcgga tccaggactt 1620 

caactacacg gaccacacgc tgggcaggat catcctcaat gccatgaacg agaccaactt 1680 

cttcggggtc acgggteaag ttgtattccg gaatggggag agaatgggga ccattaaatt 174 0 

tactcaattt caagacagca gggaggtgaa ggtgggagag tacaacgctg tggccgacac 1800 

actggagatc atcaatgaca ccatcaggtt ccaaggatcc gaaccaccaa aagacaagac 1860 

catcatcctg gagcagctgc ggaagatctc cctacctctc tacagcatcc tctctgccct 1920 

caccatcctc gggatgatca tggccagtgc ttttctcttc ttcaacatca agaaccggaa 1980 

tcagaagctc ataaagatgt cgagtccata catgaacaac cttatcatcc ttggagggat 2040 

gctctcctat gcttccatat ttctctttgg ccttgatgga tcctttgtct ctgaaaagac 2100 

ctttgaaaca ctttgcaccg tcaggacctg gat^ctcacc gtgggctaca cgaccgcttt 2160 

tggggccatg tttgcaaaga cctggagagt ccacgccatc ctcaaaaatg tgaaaatgaa 222C 

gaagaagatc atcaaggacc agaaactgct tgtgatcgtg gggggcatgc t.gctgatcga 22 80 

cctgtgcatc ctgatctgct ggcaggctgt ggaccccctg cgaaggacag cggagaagta 2340 

cagcatggag ccggacccag caggacggga tatctccatc cgccctctcc tggagcactg 2400 

tgagaacacc catatgacca tctggcttgg catcgtctat gcctacaagg gacttctcat 2460 

gttgttcggt tgtttcttag cttgggagac ccgcaacgtc agcatccccg cactcaacga 252G 

cagcaagtac atcgggatga gtgtctacaa cgnggggatc atgtgcatca tcggggccgc 2580 

tgtctccttc ctgacccggg accagcccaa tgtgcagttc tgcatcgtgg ctctggtcat 2640 

caccttctgc agcaccatca ccctctgcct ggtatccgcg ccgaagctca tcaccctgag 270C 

aacaaaccca gatgcagcaa cgcagaacag gcgattccag ttcactcaga atcagaagaa 2760 

agaagattct aaaacgtcca cctcggtcac cagtgtgaac caagccagca catcccgcct 2820 
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ggagggccta cagtcagaaa accatcgcct gcgaatgaag atcacagagc tggataaaga 2 880 

cttggaagag gtcaccatgc agctgcagga cacaccagaa aagaccaccL acatcaaaca 2 94 0 

gaaccactac caagagctca atgacatcct caacctggga aacttcactg agagcacaga 3000 

tggaggaaag gccattttaa aaaatcacct cgatcaaaat ccccagctac agtggaacac 3 060 

aacagagccc tctcgaacat gcaaagaccc tacagaagat ataaactctc cagaacacat 3120 

ccagcgtcgg ctgtcccccc agccccccac cctccaccac gcccacctcc cacccatcgg 3180 

aggcgtggac gccagctgtg tcagcccctg cgccagcccc accgccagcc cccgccacag 3240 

acacgtgcca cccccctccc gagtcatggt ctcgggcctg taagggtggg aggcctgggc 3 3 00 

ccggggcctc ccccgtgaca gaaccacact gggcagaggg gtctgctgca gaaacactgt 3360 

cggctctggc tgcggagaag ctgggcacca tggctggcct cccaggacca ctcggatggc 34 20 

actcaggtgg acaggacggg gcagggggag acccggcacc tgacctcgag ccttatttgt 3480 

gaagtcctta tttctccaca aagaagagga acggaaatgg gacgtcttcc ttaacatccg 3 540 

caaacaagga ggcgctggga Catcaaactt gcaaaaaaaa aaaaaaaaca aaacaaaaaa 3600 

aaaaactaga caaggagaga ggcactagaa ctccagctgg aagtcacgga gtggctcgag 3660 

cagccttggg aagaggcaag gagcttctga agaaactgcc tctgcacaca catcactgac 3 720 

tgcgacccct caggctagcc cttctccact ctgggggagg aggtgggaag ggccaccagg 3 780 

cccccagctg ccaggccagc tgaccccagc cttcctggaa cagggagtct gcaggagcgc 3 84 0 

agacaggcac agccctggag caggcaggcc gagggctgcg gcactggagc aggctgactt 3 900 

acatgctcca cacgggacct gtgtcaccca atgagatgtt tgttactctg gtaaatgcca 3 960 

cacgttaaca caataacacc cattcctggg accgtgggga tttagggcac c*';caccgcag 4020 

acacgccctg cagcattcac cgacagtctg tcatgcaccc accacgttgg ccacgtcctit 4080 

gcgttcctat cggatgctcc cagtaaccag ggggaccacc cgagctaatc acggaatgtc 4140 

cgttcccagc aaacacgata aagaaagatt gtgcacttta acctctctca tcagggccca 4200 

agggctggct gggatttttc tttttttttt ttcccactaa ctttgttcct gaccaaagtg 4260 

aattggaggc attctgctaa aagacatccc cgtagacata ggggagagag ttgctggctg 4 3 20 

agggcttccc ttggcttcca gaaggcagcc ttccatccag acaagccagt gagctctccc 4 3 80 

cttgggatca ctggggtgac cagtcagcag attgatnctc attcataaga tcattcctcc 4 44 0 

ctttaaattg agcccctaag agcactggcc tgggagtcag acagacctgg gttcaagtcc 4 500 

ccagtcccct gcccaccccc tacgcgacct tgatcaggtc actagtgcct ctctgagcct ' 4 560 

cagtttcccc cctgtaactt ggggttgaac taaaacacct gtcccgccta cctcacaagg 4620 

ccactctgag gattgaaact tgatcttgtc caggaaagct tcgtaccaaa cagcgaagcc 4680 

gccccgatcc gtgaggtatg agtatgactc tgaccttcag ccctccctac agccgggggt 4 74 0 

gtggcccaga gaagcttcca gcacagccct ctacccagaa catccgggct ggagggaggc 4 800 

tcccagtgac ttttctgaca tccctagaca ggttcactct tcgctcaaga aaggcccgaa 4 860 

tgacaatgtc caggatgtct gcacaaccga gcagcccgct cactccccaa agaaacctat 4 920 

tggcagcttc aacaggcagg caataatctc ttcccagaac cactgcagtc aggaataaac 4 980 

tgttttctcc accaggcttt gacaaaaggg cccacaggaa tcttaccaat gccaacattt 5040 

caaagcaccc tatttcacgt agcatagctt tctgctcccc ttccccaaag agaggttatg 5100 

gaggtactgt agcttttagg gaaaaaaaaa atgttaacac atcacaggcc aagttgaagt 516 0 

cattctctgt ctaggcacta aaaatcggcg ttgtcactca ctgtgtatta ccagtattta 5220 

cttgctttct tgacttcacc aaaaccaaat ctaatttaaa ggaccacact aatttttcaa 5280 

agggaaagag acaattaatt gtacataatg tatacacaca caaaaaaaaa aatacctgna 534 0 

gaaatattat tccagcatag caggaaaaca aacaaaagta ttggactgtc ggaggtgagc 54 00 

ccgtgcgtct gtaacccttt gtgactcctg agcgtgcgct gtcttctagg ttaactcacg 5460 

aagtacaccc tctgtcttac cgatactgta ggttcaccca ttttttttta atttcctcgc 5520 

aaataacaag acccacagaa gtgactccag ctactcaatg gttctgttct cttatatgca 5580 

gcaaacacac cgcccatttc tgaagaggct tcggcctgaa ggcattttcc aatgatgtta 5640 

gtgcacaaaa cgctttaaat tagactggaa ctgccagaat caaatgcaaa cgaggaactt 5700 

ctcgtacccc tactgcatgg tatcgatttt caataaattg ctgcaaattt gtttttatga 5760 

ataaaaggaa aaaacctgtc gtcttt 5786 



<210> 2 
<211> 5459 
<212> DNA 

<213> Rattus norvegicus 



<400> 2 

ggccgttctg agcgagcaga agccggagcc gagccggggc ccgtgccggc gccattgcgc 60 

gggrgccgcg ggcaaakctc ggcgccgggc ggcgggccgg gccaggccat gcgggccgag 120 

tgagctggcg cccgcagccc gcggcgcggc atggcttccc cgccgagccc cgggcagccc 180 

cggccgccgc cgccgccgcc gccgcccgcg cgcctgctgc tgcccctgct gctgtcgctg 240 

ctgccgtggt tggcgcccgg ggcctggggc tggacgcggg gcgccccccg gccgccgccc 3 00 

agcagcccgc cgctctccat cacgggcccc atgccgctca ccaaggaggt ggccaagggc 3 50 

agcaccgggc gcggcgtgct ccccgccgtg gagctagcca tcgagcagat ccgcaacgag 4 20 

tcacccctgc gcccctactt cccggacccg cgactctacg acaccgagtg tgacaatgca 480 

aagggactga aagccttcta tgacgcaata aagcacgggc cgaaccattt gar.ggtgttt 54 0 

ggaggcgtct gtccgtctgt cacatctatt atcgcggagt ccctccaagg ctggaatctg 600 

gtgcagcttt ccttcgccgc caccacgcct gttcttgcgg ataagaagaa gtacccgtat 660 

ttcttccgga cggtgccgtc agacaacgcg gtgaaccccg ccaccctgaa gctcctgaag 720 

cacttccgct ggcggcgcgt gggcacactc acgcaggacg ygcagcgctt ctccgaggtg 78 0 
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aggaatgacc tgactggggt tctgtatggg gaagatattg agatctcaga cacagagagt 84 0 

ttctccaatg atccctgcac cagcgtcaaa aagctcaagg ggaatgacgt gcggatcatc 9C0 

cttggccagt ttgaccagaa tatggcagca aaagtcttct gttgtgcctt cgaggagagc 960 

atgtttggca gcaagtacca gtggatcatc ccgggatggt acgagcctgc gtggtgggag 1020 

caggtgcatg tggaggccaa ttcctcacgc tgcctgcgca gaagcctcct ggctgccatg 10 8 0 

gaaggttaca tcggagtgga ctttgagccc ctgagctcca aacaaatcaa gacccitctca 114 0 

gggaagactc cacagcagtt tgaaagagag tacaacagca aacgttcagg cgtggggccc 12 00 

agcaagttcc atgggtacgc ctacgatggg atctgggtca tcgccaagac cctacagagg 126 0 

gccatggaga cactgcatgc cagtagcagg caccagcgga tccaggactt caactacaca 132 0 

gaccacacgc tgggcaaaat catcctcaat gccatgaacg agaccaactt cttcggggtc 13 8 0 

acgggtcaag ttgtgttccg gaacggggag agaatgggaa ccattaaatt tactcaattt 144 0 

caagacagca gagaggtgaa ggtcggcgaa tacaacgcgg tggctgacac actggagatc 1500 

atcaatgaca ccataaggtt ccaggggtcc gagccaccca aggacaagac catcattctg 1560 

gagcagcttc ggaagatctc gcttccactg tatagcatcc tgtccgctct caccatcctc 1620 

ggcatgatca tggccagcgc cttcctcttc ttcaacatca agaaccggaa ccaaaagctg 1680 

attaagatgt caagccccta catgaacaac ctcatcatcc tgggaggaat gctgtcctat 1740 

gcatccatct tcctctttgg cctcgacggg tccttcgtct cagaaaagac ctttgaaaca 1800 

ctctgcacgg tccggacctg gattctcacc gtgggctaca caactgcctt tggggccatg 1860 

tttgcaaaga cctggagggt ccatgccatc ttcaaaaatg tgaagatgaa gaagaagatc 1920 

atcaaagacc agaagctgct tgtgattgtg gggggcatgc tgctcatcga cctgtgcatc 1980 

ctgatctgtt ggcaggctgt ggaccccctg cggaggacag tggagaggta cagcatggag 2 04 0 

ccggacccag caggccggga catctccatc cgcccattgc tggaacactg cgaaaacacc 2100 

cacatgacca tctggcttgg cattgtctac gcctacaagg ggctcctcat gctattcggt 2160 

tgtttcttgg catgggaaac ccgcaatgtg agcatccctg ccctcaacga cagcaagtac 2220 

atcggcatga gtgtgtacaa tgtggggatc atgtgcatca tcggggctgc tgtctccttc 2280 

ctgacgcgtg accagcccaa cgtgcagttc tgcatcgtgg ccctggtcat catcttctgc 2340 

agcaccatca ctctctgcct ggtgtttgfcg ccaaagctca tcactctgag gacaaaccct 2400 

gacgcagcca ctcagaacag gcggttccag ttcacacaga accagaagaa agaagattcg 2460 

aagacctcca cttcagtcac cagcgtgaac caggcgagca cgtcacgcct 99 a 999actg 2520 

cagtcagaaa accaccgcct tcgaatgaag atcacagagc tggacaaaga cttggaagaa 2580 

gtcaccatgc agctacaaga cacaccagag aagaccacat acatcaaaca gaatcactac 2640 

caagagctca acgacatcct cagcttgggc aacttcacag agagcacaga tggaggaaag 2700 

gccattctaa aaaatcacct cgatcaaaac ccccagctcc agtggaacac gacagagccc 2760 

tcaagaacat gcaaagaccc catagaagac atcaactccc cggagcacat ccagcgccgg 2820 

ctgtcgctcc agctccccat ccttcaccac gcctacctcc catccatcgg aggcgtggat 2880 

gccagctgcg tcagcccctg tgtcagccct accgccagcc ctcgccacag acacgtacca 2940 

ccctccttcc gagtcatggt ctcgggcctg taggggtggg aggcctgggg ccggggcctc 3 0 00 

cccgggacag caccatgctg ggccaaggcg cctgccacag gcacactgac ggcggcgaga 3060 

agctgggcac catgctgcct ctccagactg ctggaatggc gctcaggcag agcgggactc 312 0 

ggcaccgacc tcgagcctta tctgtgaagg tcttactctc acagaggaga ggaatgacaa 3180 

tgacttctcc ttcttggcgt ctgcaaacaa agaggagttg ggatgtctga aacttgcaaw 324 0 

aacaaatcaa actctagaca aaggagagag gcctcggact cctgctgtcc tcgccaagtg 3300 

gccagagcaa gggctctgca gaaatggccc tcggcactct gcacacacgt catcggtggt 33 6 0 

gcccgccccc acacaccagc ccttctccac tctgggggag aaggtgagca gggccaccag 3420 

gtccccagca gctggcccag tccacttcag acttcctgga acaagggagt cagcaggaag 34 8 0 

ggtagccggg ctctgccctg gggctggggg aatcagagca agcagctgcg catgctccca 3 54 0 

ggggacatcc atcagaagag agatgtttgt tactctctga taaatgccac acattaacac 3 600 

aataacaccc gttccgggga ctgtggggat tcagggcacc gcactgcaga catgctctgc 3 660 

aggatgcaac agacagtctg tcatgcacct accgtgtcgg ccatgttctt gtgttcaaat 3720 

ttagggttcc cggtaataag gggaaccatc cgagctaatc atggaatgtc Tigctcccaga 3780 

aaacacgata aagaaagatt gtgcacttta acctctccca tgggggccca agggctgctg 384 0 

ggattccccc tctcccaaca cactgacttt gtttttgacc aaagtgaatc aggggcattc 3900 

tgctaaaagt catctctgtg ttcctgagct agagctgctg gcccagggcc ccctggcttc 3 960 

cagaaggtag ccctccagtg cccctccagc cctaacccgt gagtgcccac ctagggtgtc 4 02 0 

acctgaggcg gtcartcagc aggctggttc ccatccatca cgacatactt tccttgtaat 4 080 

tgaaccccaa agagcactgg cctgggagtc agacctggat ttgagtcctc agtcccttca 4140 

gactctctcc gtgaccttgg gcaagtcacc agtgtctctc tgagcctcgg tttcccctct 4200 

gtaacagact ggggttgaaa taacacccat cctgcctacc tcacagggtc ar:tctgagga 4260 

tggacacttg atctcattca tccaggagag ctttgcactg ggaagcagct ccgacctgtg 4320 

aggtattagt acgaccatga ccttcagccc tcccaccagc tggggagggt gcggtgcggc 4 3 80 

ccagagaaac ttcctgcaag gccctctacc tagggacagg aacagcccag ticagaggcag 4 44 0 

gctcaggact cttctgacat tccttaatag gttcattctt tgctaaaggc ctgaatggca 4500 

ccrtccaggg gcgtctgcac aaacgagtgg tttgctcact ccctaaagaa tcctaaaggc 4560 

agcttcaaca ggcaggcaat aatctcttcc cagaccactg cagtcaggaa cacactgctg 4620 

; tcaaccacca ggctgtgacg aaagggccca cgggaatcac catcgccaac atttcaaaac 4680 

accctagttc acgtagcata gctcttcctc cccatccctc aaagagaggt tacagaggtt 4740 

ctgtagcttt taagggggga aaaatgtcaa cacatcacag gtcaagttga tgtccttctc 4800 

tgtttaggca ccaaaactcc gtgttgtcac tcactgtgtg tgttaaccag tatttccttg 4860 

ctttttcgac ttcaccaaaa ccaaatttca tttaaaagac cgyattagtt tttttcaaag 4920 

ggaaagagat aattaactgt acataatgta tacagacaaa acaaaacctg tagaaatatt 4980 

actccagcat agcaggaaaa aaaaaaaaac aaaaaacaaa aaacaaaagt attggattgt 504 0 

cggaggtgag cgtgcgtctg taatcccttg tgactccgaa ctgtgctccg tcttctaggt 5100 
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taacccacaa gatacatccc tgtctactga tcctgtaggt tcacccattt tctttcaatt 
tctctgcaat taacaagccc cacaaagtga ttctggctat ccaccggttc tgttctttta 
tatgcagcaa acacacggcc catttcctag aggcttccgc ccaagggcac cttccagtga 
cgctagtgcg cacacacaca cacacacaca cacacacaca cacacacaca cacacacgac 
tgctttcaat tagaccggaa ctggcagaat caaatgtaaa tgaggagttt ctctcgcccc 
actgcatggt atcgattttt aataaattgt tgcaaatttg tttttatgaa taacagggg 



5160 
5220 
5280 
5340 
5400 
5459 



<210> 3 
<211> 941 
<212> PRT 

<213> Homo sapiens 
<400> 3 



Met Ala Ser Pro Arg Ser Ser Gly Gin Pro Gly Pro Pro Pro Pro Pro 

15 10 is 

Pro Pro Pro Pro Ala Arg Leu Leu Leu Leu Leu Leu Leu Pro Leu Leu 

20 25 30 

Leu Pro Leu Ala Pro Gly Ala Trp Gly Trp Ala Arg Gly Ala Pro Arg 

35 40 45 

Pro Pro Pro Ser Ser Pro Pro Leu Ser He Met Gly Leu Met Pro Leu 

50 55 60 

Thr Lys Glu Val Ala Lys Gly Ser He Gly Arg Gly Val Leu Pro Ala 
65 70 75 80 

Val Glu Leu Ala He Glu Gin He Arg Asn Glu Ser Leu Leu Arg Pro 

85 90 95 

Tyr Phe Leu Asp Leu Arg Leu Tyr Asp Thr Glu Cys Asp Asn Ala Lys 

100 105 no 

Gly Leu Lys Ala Phe Tyr Asp Ala He Lys Tyr Gly Pro Asn His Leu 

115 120 125 

Met Val Phe Gly Gly Val Cys Pro Ser Val Thr Ser He He Ala Glu 

130 135 140 

Ser Leu Gin Gly Trp Asn Leu Val Gin Leu Ser Phe Ala Ala Thr Thr 
145 150 155 160 

Pro Val Leu Ala Asp Lys Lys Lys Tyr Pro Tyr Phe Phe Arg Thr Va 1 

1S5 170 175 

Pro Ser Asp Asn Ala Val Aan Pro Ala He Leu Lys Leu Leu Lys His 

180 185 190 

Tyr Gin Trp Lys Arg Val Gly Thr Leu Thr Gin Asp Val Gin Arg Phe 

195 200 205 

Ser Glu Val Arg Asn Asp Leu Thr Gly Val Leu Tyr Gly Glu Asp He 

210 215 220 

Glu He Ser Asp Thr Glu Ser Phe Ser Asn Asp Pro Cys Thr Ser Val 
225 230 235 240 

Lys Lys Leu Lys Gly Asn' Asp Val Arg. He He Leu Gly Gin Phe Asp 

245 250 255 

Gin Asn Met Ala Ala Lys Val Phe Cys Cys Ala Tyr Glu Glu Asn Met 

260 265 270 

Tyr Gly Ser Lys Tyr Gin Trp He He Pro Gly Trp' Tyr Glu Pro Ser 

275 280 285 

Trp Trp Glu Gin Val His Thr Glu Ala Asn Ser Ser Arg Cys Leu Arg 

290 295 * 300 

Lys Asn Leu Leu Ala Ala Met Glu Gly Tyr He Gly Val Asp Phe Glu 
305 310 315 320 

Pro Leu Ser Ser Lys Gin He Lys Thr He Ser Gly Lys Thr Pro Gin 

325 330 335 

Gin Tyr Glu Arg Glu Tyr Asn Asn Lys Arg Ser Gly Val Gly Pro Ser 

340 345 350 

Lys Phe His Gly Tyr Ala Tyr Asp Gly He Trp Val He Ala Lys Thr 

355 360 ' 365 

Leu Gin Arg Ala Met Glu Thr Leu His Ala Ser Ser Arg Kis Gin Arg 

370 375 380 

He Gin Asp Phe Asn Tyr Thr Asp His Thr Leu Gly Arg He He Leu 
385 390 395 400 

Asn- Ala Met Asn Glu Thr Asn Phe Phe Gly Val Thr Gly Gin Val Val 

405 410 415 

Phe Arg Asn Gly Glu Arg Met Gly Thr He Lys Phe Thr Gin Phe Gin 

420 425 430 

Asp Ser Arg Glu Val Lys Val Gly Glu Tyr Asn Ala Val Ala Asp Thr 

435 440 445 

Leu Glu He He Asn Asp Thr He Arg Phe Gin Gly Ser Glu Pro Pro 
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450 455 460 

Lys Asp Lys Thr lie lie Leu Glu Gin Leu Arg Lvs lie Ser Leu Pro 
465 470 475 " 480 

Leu Tyr Ser lie Leu Ser Ala Leu Thr lie Leu Gly Met lie Met Ala 

485 490 495 

Ser Ala Phe Leu Phe Phe Asn lie Lys Asn Arg Asn Gin Lys Leu lie 

500 505 510 

Lys Mec Ser Ser Pro Tyr Met Asn Asn Leu He He Leu Gly Gly Met 

515 520 525 

Leu Ser Tyr Ala Ser He Phe Leu Phe Gly Leu Asp Gly Ser Phe Val 

530 535 540 

Ser Glu Lys Thr Phe Glu Thr Leu Cys Thr Val Arg Thr Trp He Leu 
545 5S0 555 S 60 

Thr Val Gly Tyr Thr Thr Ala Phe Gly Ala Met Phe Ala Lys Thr Trp 

SSS 570 575 

Arg Val His Ala He Phe Lys Asn Val Lys Met Lys Lys Lys He He 

580 585 590 

.Lys Asp Gin Lys Leu Leu Val He Val Gly Gly Met Leu Leu He Asp 

595 600 605 

Leu Cys lie Leu He Cys Trp Gin Ala Val Asp Pro Leu Arg Arg Thr 

610 615 620 

Val Glu Lys Tyr Ser Met Glu Pro Asp Pro Ala Gly Arg Asp He Ser 
625 630 635 64C 

He Arg Pro Leu Leu Glu His Cys Glu Asn Thr His Met Thr He Trp 

645 550 655 

Leu Gly He Val Tyr Ala Tyr Lys Gly Leu Leu Met Leu Phe Gly Cys 

660 6S5 670 

Phe Leu Ala Trp Glu Thr Arg Asn Val Ser He" Pro Ala Leu Asn Asp 

675 680 685 

Ser Lys Tyr lie Gly Met Ser Val Tyr Asn Val Gly He Met Cys He 

690 695 700 

lie Gly Ala Ala Val Ser Phe Leu Thr Arg Asp Gin Pro Asn Val Gin 
705 710 715 720 

Phe Cys lie Val Ala Leu Val He He Phe Cy3 Ser Thr He Thr Leu 

725 730 735 

Cys Leu Val Phe Val. Pro Lys Leu He Thr Leu Arg Thr Asn Pro Asp 

740 745 750 

Ala Ala Thr Gin Asn Arg Arg Phe Gin Phe Thr Gin Asn Gin Lys Lys 

755 760 765 

Glu Asp Ser Lys Thr Ser Thr Ser Val Thr Ser Val Asn Gin Ala Ser 

770 775 780 

Thr Ser Arg Leu Glu Gly Leu Gin Ser Glu Asn His Arg Leu Arg Met 
785 790 795 300 

Lys He Thr Glu Leu Asp Lys Asp Leu Glu Glu Val Thr Met Gin Leu 

805 810 . 815 

Gin Asp Thr Pro Glu Lys Thr Thr Tyr He Lys Gin Asn His Tyr Gin 

820 82S 830 

Glu Leu Asn Asp He Leu Asn Leu Gly Asn Phe Thr Glu Ser Thr Asp 

835 840 345 

Gly Gly Lys Ala He Leu Lys Asn His Leu Asp Gin Asn Pro Gin Leu 

850 855 860 

Gin Trp Asn Thr Thr Glu Pro Ser Arg Thr Cys Lys Asp Pro He Glu 
865 870 875 880 

Asp He Asn Ser Pro Glu His He Gin Arg Arg Leu Ser Leu Gin Leu 

885 890 895 

Pro He Leu His His Ala Tyr Leu Pro Ser He Gly Gly Val Asp Ala 

900 905 910 

Ser Cys Val Ser Pro Cys Val Ser P ro Thr Ala Ser Pro Arg His Arg 

915 920 925 

His Val Pro Pro Ser Phe Arg Val Met Val Ser Gly Leu 
930 935 940 

<210> 4 
<211> 940 
<212> PRT 

<213> Rattus norvegicus 
<400> 4 

Met Ala Ser Pro Pro Ser Ser Gly Gin Pro Arq Pro Pre Pro Pro Pro 

1 5 10 15 

Pro Pro Pro Ala Arg Leu Leu Leu Pro Leu Leu Leu Ser Leu Leu Leu 
20 25 30 
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Trp Leu Ala Pro 

35 

Pro Pro Ssr Ser 
50 

Lys Glu Val Ala 
65 

Glu Leu Ala He 

Phe Leu Asp Leu 
100 

Leu Lys Ala Phe 
115 

Val Phe Gly Gly 
130 

Leu Gin Gly Trp 
145 

Val Leu Ala Asp 

Ser Asp Asn Ala 
180 

Arg Tro Arg Arg 
195 

Glu Val Arg Asn 
210 

He Ser Asp Thr 
225 

Lys Leu Lys Gly 

Asn Met Ala Ala 
260 

Gly Ser Lys Tyr 
275 

Trp Glu Gin Val 
290 

Ser Leu Leu Ala 
305 

Leu Ser Ser Lys 

Phe Glu Arg Glu 
340 

Phe His Gly Tyr 
355 

Gin Arg Ala Met 
370 

Gin Asp Phe Asn 
385 

Ala Met Asn Glu 

Arg Asn Gly Glu 
420 

Ser Arg Glu Val 
435 

Glu He He Asn 
450 

Asp Lys Thr He 
465 

Tyr Ser He Leu 

Ala Phe Leu Phe 
500 

Met Ser Ser Pro 
515 

Ser Tyr Ala Ser 
530 

Glu Lys Thr Phe 
545 

Val Gly Tyr Thr 

Val His Ala He 
580 

Asp Gin Lys Leu 
595 



Gly Ala Trp Gly 
40 

Pro Pro Leu Ser 
55 

Lys Gly Ser He 
70 

Glu Gin He Arg 
85 

Arg Leu Tyr Asp 

Tyr Asp Ala He 
120 

Val Cys Pro Ser 
135 

Asn Leu Val Gin 
150 

Lys Lys Lys Tyr 
165 

Val Asn Pro Ala 

Val Gly Thr Leu 
200 

Asp Leu Thr Gly 
215 

Glu Ser Phe Ser 
230 

Asn Asp Val Arg 
245 

Lys Val Phe Cys 

Gin Trp He He 
280 

His Val Glu Ala 
295 

Ala Met Glu Gly 
310 

Gin He Lys Thr 
325 

Tyr Asn Ser Lys 

Ala Tyr Asp Gly 
360 

Glu Thr Leu His 
375 

Tyr Thr Asp His 
390 

Thr Asn Phe Phe 
405 

Arg Met Gly Thr 

Lys Val Gly Glu 
440 

Asp Thr He Arg 
455 

He Leu Glu Gin 
470 

Ser Ala Leu Thr 
485 

Phe Asn He Lys 

Tyr Met Asn Asn 
520 

He Phe Leu Phe 
535 

Glu Thr Leu Cys 
550 

Thr Ala Phe Gly 
565 

Phe Lys Asn Val 

Leu Val He Val 
600 



Trp Thr Arg Gly 

He Met Gly Leu 
60 

Gly Arg Gly Val 

• 75 ' 
Asn Glu Ser Leu 
90 

Thr Glu Cys Asp 
105 

Lys Tyr Gly Pro 

Val Thr Ser He 
140 

Leu Ser Phe Ala 

155 

Pro Tyr Phe Phe 
170 

He Leu Lys Leu 
185 

Thr Gin Asp Xaa 

Val Leu Tyr Gly 
220 

Asn Asp Pro Cys 
235 

He He Leu Gly 
250 

Cys Ala Phe Glu 
265 

Pro Gly Trp Tyr 

Asn Ser Ser Arg 
300 

Tyr He Gly Val 
315 

He Ser Gly Lys 
330 

Arg Ser Gly Val 
345 

He Trp Val He 

Ala Ser Ser Arg 
380 

Thr Leu Gly Lys 
395 

Gly Val Thr Gly 
410 

He Lys Phe Thr 
425 

Tyr Asn Ala Val 

Phe Gin Gly Ser 
460 

Leu Arg Lys He 
475 

He Leu Gly Met 
490 

Asn Arg Asn Gin 
505 

Leu He He Leu 

Gly Leu Asp Gly 
540 

Thr Val Arg Thr 
555 

Ala Met Phe Ala 
570 

Lys Met Lys Lys 
585 

Gly Gly Met Leu 



Ala Pro Arg Pro 
45 

Mec Pro Leu Thr 

Leu Pro Ala Val 
80 

Leu Arg Pro Tyr 
95 

Asn Ala Lys Gly 
110 

Asn His Leu Met 
125 

He Ala Glu Ser 

Ala Thr Thr Pro 
160 

Arg Th.c Val Pro 
175 

Leu Lys His Phe 
190 

Gin Arg Phe Ser 
205 

Glu Asp He Glu 

Thr Ser Val Lys 
240 

Gin Phe Asp Gin 
255 

Glu Ser Met Phe 
270 

Glu Pro Ala Trp 
285 

Cys Leu Arg Arg 

Asp Phe Glu Pro 
320 

Thr Pro Gin Gin 
335 

Gly Pro Ser Lys 
350 

Ala Lys Thr Leu 
365 

His Gin Arg He 

He He Leu Asn 
400 

Gin Val Val Phe 
415 

Gin Phe Gin Asp 
430 

Ala Asp Thr Leu 
445 

Glu Pro Pro Lys- 

Ser Leu Pro Leu 
480 

He Met Ala Ser 
495 

Lys Leu He Lys 
510 

Gly Gly Met Leu 
525 

Ser Phe Val Ser 

Trp He Leu Thr 
560 

Lys Thr Trp Arg 
575 

Lys He He Lys 
590 

Leu He Asp Leu 
605 
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Cys lie Leu He Cys Trp Gin Ala Val Asp Pro Leu Axg Arg Thr Val 

610 615 620 

Glu Arg Tyr Ser Met Glu Pro Asp Pro Ala Gly Arg Asp lie Ser He 
625 630 635 640 

Arg Pro Leu Leu Glu His Cys Glu Asn Thr His Met Thr He Trp Leu 

645 650 655 

Gly He Val Tyr Ala Tyr Lys Gly Leu Leu Met Leu Phe Gly Cys Phe 

660 665 670 

Leu Ala Trp Glu Thr Arg Asn Val Ser He Pro Ala Leu Asn Asp Ser 

675 680 685 

Lys Tyr He Gly Met Ser Val Tyr Asn Val Gly lie Met Cys He lie 

690 695 700 

Gly Ala Ala Val Ser Phe Leu Thr Arg Asp Gin Pro Asn Val Gin Phe 
70S 710 715 720 

Cys He Val Ala Leu Val He He Phe Cys Ser Thr He Thr Leu Cys 

725 730 735 

Leu Val Phe Val Pro Lys Leu He Thr Leu Arg Thr Asn Pro Asp Ala 

740. 745 750 

Ala Thr Gin Asn Arg Arg Phe Gin Phe Thr Gin Asn Gin Lys Lys Glu 

755 760 765 

Asp Ser Lys Thr Ser Thr Ser Val Thr Ser Val Asn Gin Ala Ser Thr 

770 775 780 

Ser Arg Leu Glu Gly Leu Gin Ser Glu Asn His Arg Leu Arg Met Lys 
785 790 795 800 

He Thr Glu Leu Asp Lys Asp Leu Glu Glu Val Thr Met Gin Leu Gin 

805 810 815 

Asp Thr Pro Glu Lys Thr Thr Tyr He Lys Gin Asn His Tyr Gin Glu 

820 825 830 

Leu Asn Aso He Leu Ser Leu Gly Asn Phe Thr Glu Ser Thr Asp Gly 

835 840 845 

Gly Lys Ala He Leu Lys Asn His Leu Asp Gin Asn Pro Gin Leu Gin 

850 855 860 

Trp Asn Thr Thr Glu Pro Ser Arg Thr Cys Lys Asp Pro He Glu Asp 
865 870 875 880 

He Asn Ser Pro Glu His He Gin Arg Arg Leu Ser Leu Glr Leu Pro 

885 890 895 

He Leu His -His Ala Tyr Leu Pro Ser He Gly Gly Val Asp Ala Ser 

900 905 910 

Cys Val Ser Pro Cys Val Ser Pro Thr Ala Ser Pro Arg His Arg His 

915 920 925 

Val Pro Pro Ser Phe Arg Val Met Val Ser Gly Leu 
930 935 940 

<210> 5 
<211> 36 
<212> DNA 

<213> Artificial Sequence 
c400> 5 

gcgggatccc ttcggcacga ataccaggca gagggt $q 

<210> 6 
<211> 35 
<212> DNA 

<213> Artificial Sequence 
<400> 6 

ccggaattcc agaagctcat aaagatgtcg agtcc 35 

<210> 7 
<211> 36 
<212> DNA 

<213> Artificial Sequence 
c400> 7 

ccggaattct tcttagcttt gggagacccg caacgt 35 

<210> 8 

<211> 26 

<212> DNA 

<213> Artificial Sequence 
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c400> 8 

agcactagaa ctccagctgg aagtca 

<210> 9 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<400> 9 

gcctacgatg gcatctgggt catcgc 

<210> 10 
<211> 27 
c212> DNA 

c213> Artificial Sequence 

<40O> 10 
cagattccag ccttggaggg accctgc 

<210> 11 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<400> 11 

tgcatcgtag aaggctttca acccttttgc 

<210> 12 
<211> 25 
<212> DNA 

<213> Artificial Sequence 

<400> 12 
aataaggctc gaggtcaggt gccaa 

<210> 13 
<211> 24 
<212> DNA 

<213> Artificial Sequence 

<400> 13 
cagcagcccg ccgctctcca teat 

<210> 14 
<211> 46 
<212> DNA 

<213> Artificial Sequence 
<400> 14 

gegegtaata cgactcacta taggggtagt tgaagtcctg gatccg 

<210> 15 
c211> 25 
<212> DNA 

<213> Artificial Sequence 

<400> 15 
cgctggtgca gggatcattg gagaa 

<210> 16 
<211> 25 
<212> DNA 

c213> Artificial Sequence 

<400> 16 
catattctgg tcaaactggc caagg 

<210> 17 
<211> 38 
<212> DNA 

<213> Artificial Sequence 
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26 



26 



27 



30 



25 



24 



46 



25 
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<400> 17 

ccggaattcc caccatgggc ccggggggac cctgtacc 3 8 

<210> 18 
<211> 37 
<212> DNA 

<213> Artificial Sequence 
<40O> 18 

ctagtctaga tcacttgtaa agcaaatgta cucgact 37 

<210> 19 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<400> 19 

gagtagttcg tacaagtact tggtggc 2 7 

<210> 20 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<40O> 20 

ccggaattcc caccatgctg ctgctgctgc tggtgcctct c 41 

<210> 21 
<211> 44 
<212> DNA 

<213> Artificial Sequence 
<400=» 21 

cgcgcaatta accctcacta aaggacaaca gcaaacgttc aggc 44 

<210> 22 
<211> 43 
<212> DNA 

<213> Artificial Sequence 
<400> 22 

gcgcgtaata cgactcacta tagggcatgc ctatgatggt gag 43 

<210> 23 
* <211> 44 
<212> DNA 

<213> Artificial Sequence 
<400> 23 

cgcgcaatta accctcacta aaggctgagg acaaaccctg acgc 44 

<210> 24 
<211> 43 
c212> DNA 

<213i Artificial Sequence 
<400> 24 

gcgcgtaata cgactcacta taggggatgt cttctatggg gtc 43 

<210> 25 
<211> 42 
<212> DNA 

<213> Artificial Sequence 
<400> 25 

cgcgcaatta accctcacta aaggaagctt atccaccacg ac 42 

<210> 26 
<211> 43 
<212> DNA 

<213> Artificial Sequence 
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<400> 26 

gcgcgtaata cgactcacta tagggagctg gatccgagaa gaa 

<210> 27 
<211> 3399 
<212> DNA 

<213> Homo sapiens 



43 



<400 
gccgcccctg 
ccgagctcca 

gggggcgggc 

acgccggccc 

cccacccctt 

gccgctctga 

attgcgcggg 

ggccgagtga 

gcagcccggg 

gctgccgctg 

gccgccgccc 

ggccaagggc 

ccgcaacgag 

cgacaacgca 

gatggtgttt 

ctggaatctg 

atacccttat 

gttgctcaag 

ctctgaggtg 

caccgagagc 

gcggatcacc 

cgaggagaac 

ttggcgggag 

tgctgccatg 

gaccatctca 

cgtggggccc 

actgcagagg 

caactacacg 

cttcggggtc 

tactcaattt 

actggagacc 

catcatcctg 

caccatcctc 

tcagaagctc 

gctctcctac 

ctttgaaaca 

tggggccatg 

gaagaagacc 

cctgtgtatc 

cagcacggag 

tgagaacacc 

gttgttcggt 

cagcaagtac 

tgtctccttc 

catcttctgc 

aacaaaccca 

agaagattct 

ggagggccta 

cttggaagag 

gaaccactac 

tggaggaaag 

aacagagccc 

ccagcgtcgg 

ccgagtcatg 

cagaaccaca 

agctgggcac 

gggcaggggg 



> 27 
agcagcctcg 
ggggctgccg 
ggaggcggcc 
ccctctccgc 
cccggcgtga 
gccgagccgg 
cgccgcggga 
gccggcgccc 
ccgccgccgc 
ctgctgcctc 
agcagcccgc 
agcatcgggc 
ccactcccgc 
aaagggttga 
ggaggcgtct 
gtgcagcttt 
ttctttcgga 
caccaccagt 
cggaatgacc 
ttccccaacg 
cttggccagc 
atgtatggta 
caggcgcaca 
gagggctaca 
ggaaagactc 
agcaagttcc 
gccatggaga 
gaccacacgc 
acgggtcaag 
caagacagca 
atcaatgaca 
gagcagctgc 
gggatgatca 
ataaagatgt 
gcttccatat 
ctttgcaccg 
tttgcaaaga 
atcaaggacc 
ccgatctgct 
ccggacccag 
catatgacca 
tgtttcttag 
atcgggatga 
ctgacccggg 
agcaccatca 
gatgcagcaa 
aaaacgccca 
cagtcagaaa 
gtcaccatgc 
caagagccca 
gccacttcaa 
tctcgaacat 
ccgtccctcc 
gcctcgggcc 
ccgggcagag 
catggctggc 
agacttggca 



ccttcgcctc 

cctagcagcn 

cggtgcgggg 

gtttgctcgc 

ttgatccgtc 

aaccctagcc 

agaccccggc 

gcagcccgcg 

cgccgccacc 

tggcgcccgg 

cgctctccat 

gcggtgtgct 

gcccctactt 

aagcccccta 

gtccatccgc 

ctttcgctgc 

ccgccccatc 

ggaagcgcgt 

tgactggagt 

atccctgtac 

ttgaccagaa 

gtaaacatca 

cggaagccaa 

ttggcgcgga 

cacagcagta 

acgggtacgc 

cactgcatgc 

tgggcaggat 

ttgtattccg 

gggaggtgaa 

ccatcaggtt 

ggaagatctc 

tggccagtgc 

cgagtccata 

ttctctctgg 

tcaggacctg 

cctggagagt 

agaaactgct 

ggcaggctgt 

caggacggga 

tctggcttgg 

cttgggagac 

gtgtccacaa 

accagcccaa 

ccctctgcct 

cgcagaacag 

ccccggtcac 

accatcgcct 

agctgcagga 

atgacatcct 

aaaatcacct 

gcaaagatcc 

agctccccat 

tgcaagggcg 

gggtctgccg 

ctctcaggac 

cctgacctcg 



ccgcgtttcc 

cc cggcggga 

cggccgcgct 

ttgctccccg 

acgggcgccg 

cgagacggag 

gcggggcggc 

gcgcggcatg 

gccgcccgcg 

ggcctggggc 

catgggcctc 

ccccgccgtg 

cctcgacctg 

cgatgcaata 

cacatccacc 

aaccacgcct 

agacaatgcg 

gggcacgctg 

tctgtatggc 

cagtgtcaaa 

tatggcagca 

gtggaccact 

ctcatcccgc 

tctcgagccc 

tgagagagag 

ctacgatggc 

cagcagccgg 

catcctcaat 

gaatggggag 

ggtgggagag 

ccaaggatcc 

cctacctctc 

ttttctcctc 

catgaacaac 

ccttgatgga 

gattctcacc 

ccacgccacc 

cgtgatcgcg 

ggaccccctg 

tatctccacc 

caccgcctat 

ccgcaacgtc 

cgtggggatc 

tgtgcagttc 

ggtattcgtg 

gcgattccag 

cagtgtgaac 

gcgaatgaag 

cacaccagaa 

caacctggga 

cgaccaaaat 

catagaagat 

cctccaccac 

ggaggcctgg 

cagaaacacc 

cactcggacg 

agccttatt 



tgccgtccgc 

gagcggttca 

ggagagaggc 

cctcccgcac 

ccgctgccgc 

ccggggcccg 

gggccgggcc 

gcctccccgc 

cgcctgctac 

tgggcgcggg 

atgccgctca 

gaactggcca 

cggctctatg 

aaatacgggc 

attgcagagt 

gttctagccg 

gtgaatccag 

acgcaagacg 

gaggacattg 

aagctgaagg 

aaagtgttct 

ccgggctggt 

tgcctccgga 

ccgagctcca 

tacaacaaca 

atctgggtca 

caccagcgga 

gccatgaacg 

agaatgggga 

tacaacgctg 

gaaccaccaa 

tacagcatcc 

tccaacatca 

cttatcatcc 

tcctttgtct 

gtgggctaca 

ctcaaaaatg 

gggggcatgc 

cgaaggacag 

cgccctctcc 

gcctacaagg 

agcacccccg 

atgtgcatca 

cgcatcgtgg 

ccgaagctca 

tccactcaga 

caagccagca 

atcacagagc 

aagaccacct 

aacttcactg 

ccccagctac 

ataaactctc 

agacatgtgc 

gcccggggcc 

gccggccctg 

gcactcaggn 



cctcccccgg 

gagcgcgcac 

gcgcgcggag 

tccgctcgct 

cgccgccgcg 

ggccggcgcc 

aggccatgcg 

ggagctccgg 

cgctaccgct 

gcgccccccg 

ccaaggaggt 

tcgagcagat 

acacggagcg 

cgaaccactt 

ccctccaagg 

ataagaaaaa 

ccatcccgaa 

ctcagaggct 

agatttcaga 

ggaatgatgt 

gttgtgcata 

acgagccttc 

agaatctgct 

agcagatcaa 

agcggtcagg 

ccgccaagac 

tccaggactt 

agaccaactt 

ccattaaatt 

tggccgacac 

aagacaagac 

tctctgccct 

agaaccggaa 

- tggaggga- 

ctgaaaagac 

cgaccgcttz 

tgaaaatgaa 

tgctgatcga 

tggagaagta 

tggagcactg 

gacttctcat 

cactcaacga 

tcggggccgc 

ctctggtca- 

tcaccctgag 

atcagaagaa- 

catcccgcct 

tggacaaaga 

acaccaaaca 

agagcacaga 

agtggaacac 

cagaacacat 

cacccccctc 

tcccccgcga 

gctgcggaga 

ggacaggacg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
216C 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2860 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3399 



<210> 28 
<211> 3396 
<212> DNA 

<213> Homo sapiens 
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<400> 28 

gccgcccctg agcagcctcg ccttcgcctc ccgcgtttcc tgccgtccgc cctcccccgg 60 

ccgagctcca ggggctgccg cctagcagct cccggcggga gagcggttca gagcgcgcac 120 

gggggcgggc ggaggcggcc cggtgcgggg cggccgcgct ggagagaggc gcgcgcggag 180 

acgccggccc ccctctccgc gtttgctcgc ttgctccccg ccccccgcac tccgctcgct 240 

cccacccctt cccggcgtga ttgatccgtc acgggcgccg ccgctgccgc cgccgccgcg 3 00 

gccgttctga gccgagccgg aaccctagcc cgagacggag ccggggcccg ggccggcgcc 360 

attgcgcggg cgccgcggga agaccttggc gcggggcggc gggccgggcc aggccatgcg 420 

ggccgagtga gccggcgccc gcagcccgcg gcgcggcatg gcctccccgc ggagctccgg 4 80 

gcagcccggg ccgccgccgc cgccgccacc gccgcccgcg cgcctgctac cgctactgct 54 0 

gctgccgctg ctgctgcctc tggcgcccgg ggcctggggc tgggcgcggg gcgccccccg 600 

gccgccgccc agcagcccgc cgctctccat catgggcctc atgccgctca ccaaggaggt 660 

ggccaagggc agcatcgggc gcggtgtgct ccccgccgtg gaactggcca tcgagcagat 720 

ccgcaacgag tcactcctgc gcccctactt . cctcgacctg cggctctatg acacggagtg 780 

cgacaacgca aaagggttga aagccctcta cgatgcaata aaatacgggc cgaaccactt 84 0 

gatggtgttt ggaggcgtcc gcccatccgt cacatccatc atcgcagagt ccctccaagg 900 

ctggaacccg gtgcagcttt ccttcgctgc aaccacgcct gttctagccg ataagaaaaa 960 

atacccttat ttctttcgga ccgtcccatc agacaacgcg gtgaatccag ccattctgaa 1020 

gttgctcaag cactaccagt ggaagcgcgt gggcacgctg acgcaagacg ttcagaggtt 1080 

ctctgaggtg cggaatgacc tgactggagt tctgtatggc gaggacatcg ugatttcaga 1140 

caccgagagc ttctccaacg atccctgtac cagtgtcaaa aagctgaagg ggaatgatgt 12 00 

gcggatcatc cttggccagt ttgaccagaa tatggcagca aaagtgttct gttgcgcata 1260 

cgaggagaac atgcatggta gtaaacacca gtggatcatt ccgggctggt acgagc'cttc 13 20 

ttggtgggag caggtgcaca cggaagccaa ctcatcccgc cgcctccgga agaatctgct 13 80 

tgctgccatg gagggctaca ttggcgtgga tttcgagccc ccgagctcca agcagatcaa 1440 

gaccatctca ggaaagactc cacagcagta tgagagagag tacaacaaca agcggtcagg 1500 

cgtggggccc agcaagttcc acgggtacgc ctacgatggc atctgggtca tcgccaagac 1560 

actgcagagg gccatggaga cactgcatgc cagcagccgg caccagcgga tccaggactt 162 0 

caactacacg gaccacacgc tgggcaggat catcctcaat gccacgaacg agaccaactt 16 80 

cttcggggtc acgggtcaag ttgtattccg gaatggggag agaatgggga ccaccaaatc 1740 

tactcaatct caagacagca gggaggtgaa ggtgggagag tacaacgctg tggccgacac 18 00 

actggagatc atcaatgaca ccatcaggtt ccaaggatcc gaaccaccaa aagacaagac 1860 

caccaccccg gagcagctgc ggaagatccc cctacctctc tacagcatcc tctctgcccc 1920 

caccatcctc gggatgatca tggccagtgc ttttctcttc ttcaacatca agaaccggaa 1980 

tcagaagccc ataaagatgt cgagtccata catgaacaac cttatcatcc ttggagggat 204 0 

gctctcctat gctcccatat ttctctttgg ccttgatgga tcctttgtct ctgaaaagac 2100 

ctttgaaaca ctttgcaccg tcaggacctg gattctcacc gtgggctaca cgaccgcccc 2160 

tggggccatg tttgcaaaga cctggagagt ccacgccatc ttcaaaaatg cgaaaatgaa 22 20 

gaagaagatc atcaaggacc agaaactgct tgcgaccgtg gggggcatgc tgctgatcga 22 80 

cctgtgtacc ctgatctgct ggcaggctgt ggaccccctg cgaaggacag tggagaagta 2340 

cagcatggag ccggacccag caggacggga tatctccatc cgccctctcc tggagcactg 2400 

tgagaacacc catatgacca tctggcttgg catcgcctat gcccacaagg gacttcccat 24 60 

gttgttcggt tgtttcntag ctcgggagac ccgcaacgtc agcatccccg cacccaacga 2520 

cagcaagtac atcgggacga gtgtctacaa cgtggggatc atgtgcatca ccggggccgc 25 80 

tgtctccttc ctgacccggg accagcccaa tgtgcagttc tgcatcgtgg ctctggtcat 2640 

catcttctgc agcaccatca ccctctgcct ggtatccgtg ccgaagctca tcaccccgag 2700 

aacaaaccca gatgcagcaa cgcagaacag gcgattccag ttcactcaga atcagaagaa 2760 

agaagattct aaaacgtcca cctcggtcac cagtgcgaac caagccagca catcccgcct 2820 

ggagggccta cagtcagaaa accatcgccc gcgaatgaag atcacagagc tggataaaga 2 8 80 

cttggaagag gtcaccatgc agctgcagga cacaccagaa aagaccacct acattaaaca 2940 

gaaccactac caagagctca atgacatcct caacctggga aacttcactg agagcacaga 3 000 

tggaggaaag gccattttaa aaaatcacct cgatcaaaat ccccagctac agtggaacac 3 060 

aacagagccc tctcgaacat gcaaagatcc tatagaagat ataaactctc cagaacaca-t 3120 

ccagcgtcgg ctgtccctcc agctccccat cccccaccac gcccacctcc catccatcgg 3180 

aggcgtggac gccagccgtg tcagcccctg cgtcagcccc accgccagcc cccgccacag 3 240 

aaccacactg ggcagagggg tctgctgcag aaacactgtc ggctctggct gcggagaagc 3 3 00 

tgggcaccat ggctggcctc tcaggaccac tcggatggca ctcaggtgga caggacgggg 3 3 60 

cagggggaga cttggcacct gacctcgagc etc ate 3 3 96 

<210> 29 
<211> 914 
<212> PRT 

<213> Homo sapiens 
<400> 29 

Met Ala Ser Pro Arg Ser Ser Gly Gin Pro Gly Pro Pro Pro Pro Pro 

15 10 is 

Pro Pro Pro Pro Ala Arg Leu Leu Leu Leu Leu Leu Leu Pro Leu Leu 

20 25 30 

Leu Pro Leu Ala Pro Gly Ala Tro Glv Trp Ala Arg Gly Ala Pro Arg 

35 40* * 45 

Pro Pro Pro Ser Ser Pro Pro Leu Ser lie Met Gly Leu Meu Pro Leu 
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50 55 60 

Thr Lye Glu Val Ala Lys Gly Ser lie Gly Arg Gly Val Leu Pro Ala 
65 70 75 80 

Val Glu Leu Ala lie Glu Gin He Arg Asn Glu Ser Leu Leu Arg Pro 

85 90 95 

Tyr Phe Leu Asp Arg Leu Tyr Asp Thr Glu Cys Asp Asn Ala Lys Gly 

100 105 110 

Leu Lys Ala Phe Tyr Asp Ala lie Lys Tyr Gly Pro Asn His Leu Met 

115 120 125 

Val Phe Gly Gly Val Cys Pro Ser Val Thr Ser He He Ala Glu Ser 

130 135 140 

Leu Gin Gly Trp Asn Leu Val Gin Leu Ser Phe Ala Ala Thr Thr Pro 
145 150 155 160 

Val Leu Ala Asp Lys Lys Lys Tyr Pro Tyr Phe Phe Arg Thr Val Pro 

165 170 175 

Ser Asp Asn Ala Val Asn Pro Ala lie Leu Lys Leu Leu Lys His Tyr 

180 185 190 

Gin Trp Lys Arg Val Gly Thr Leu Thr Gin Asp Val Gin Arg Phe Ser 

195 200 205 

Glu Val Arg Asn Asp Leu Thr Gly Val Leu Tyr Gly Glu Asp He Glu 

210 215 220 

He Ser Asp Thr Glu Ser Phe Ser Asn Asp Pro Cys Thr Ser Val Lys 
225 230 235 240 

Lys Leu Lys Gly Asn Asp Val Arg He He Leu Gly Gin Phe Asp Gin 

245 250 255 

Asn Met Ala Ala Lys Val Phe Cys Cys Ala Tyr Glu Glu Asn Met Tyr 

260 2S5 270 

Gly Ser Lys Tyr Gin Trp He He Pro Gly Trp Tyr Glu Pro Ser Trp 

275 280 285 

Trp Glu Gin Val His Thr Glu Ala Asn Ser Ser Arg Cys Leu Arg Lys 

290 295 300 

Asn Leu Leu Ala Ala Met Glu Gly Tyr He Gly Val Asd Phe Glu Pro 
305 310 315 * 320 

Leu Ser Ser Lys Gin He Lys Thr He Ser Gly Lys Thr Pro Gin Gin 

325 330 335 

Tyr Glu Arg Glu Tyr Asn Asn Lys Arg Ser Gly Val Gly Pro Ser Lys 

340 345 350 

Phe His Gly Tyr Ala Tyr Asp Gly He Trp Val He Ala Lys Thr Leu 

355 360 365 

Gin Arg Ala Met Glu Thr Leu His Ala Ser Ser Arg His Gin Arg He 

370 375 380 

Gin Asp Phe Asn Tyr thr Asp His Thr Leu Gly Arg He He Leu Asn 
38S 390 395 400 

Ala Met Asn Glu Thr Asn Phe Phe Gly Val Thr Gly Gin Val Val Phe 

405 410 415 

Arg Asn Gly Glu Arg Met Gly Thr He Lys Phe Thr Gin Phe Gin Asp 
420 425 430 

Ser Arg Glu Val Lys Val Gly Glu Tyr Asn Ala Val Ala Asp Thr Leu 

435 440 445 

Glu He He Asn Asp Thr He Arg Phe Gin Gly Ser Glu Pro Pro Lys 

450 455 460 

Asp Lys Thr He He Leu Glu Gin Leu Arg Lys He Ser Leu Pro Leu 
465 470 475 480 

Tyr Ser He Leu Ser Ala Leu Thr He Leu Gly Met He Met Ala Ser 

485 490 495 

Ala Phe Leu Phe Phe Asn He Lys Asn Arg Asn Gin Lys Leu He Lys 

500 505 510 

Met Ser Ser Pro Tyr Met Asn Asn Leu He He Leu Gly Gly Met Leu 

515 520 525 

Ser Tyr Ala Ser He Phe Leu Phe Gly Leu Asp Gly Ser Phe Val Ser 

530 535 540 

Glu Lys Thr Phe Glu Thr Leu Cys Thr Val Arg Thr Trp He Leu Thr 
545 550 555 560 

Val Gly Tyr Thr Thr Ala Phe Gly Ala Met Phe Ala Lys Thr Trp Arg 

565 570 575 

Val His Ala He Phe Lys Asn Val Lys Met Lys Lys Lys He He Lys 

580 585 590 

Asp Gin Lys Leu Leu Val He Val Gly Gly Met Leu Leu He Asp Leu - 

595 600 605 

Cys He Leu He Cys Trp Gin Ala Val Asp Pro Leu Arg Arg Thr Val 

510 615 620 

Glu Lys Tyr Ser Met Glu Pro Asp Pro Ala Gly Arg Asp He Ser lie 
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625 630 635 640 

Arg Pro Leu Leu Glu His Cys Glu Asn Thr His Met Thr lie Trp Leu 

645 650 6 55 

Gly He Val Tyr Ala Tyr Lys Gly Leu Leu Met Leu Phe Gly Cys Phe 

660 665 670 

Leu Ala Trp Glu Thr Arg Asn Val Ser He Pro Ala Leu Asn Asp Ser 

675 680 685 

Lys Tyr He Gly Met Ser Val Tyr Asn Val Gly lie Met Cys lie He 

690 695 700 

Gly Ala Ala Val Ser Phe Leu Thr Arg Asp Gin Pro Asn Val Gin Phe 
70S 710 715 720 

Cys He Val Ala Leu Val He He Phe Cys Ser Thr He Thr Leu Cvs 

725 730 735 

Leu Val Phe Val Pro Lys Leu He Thr Leu Arg Thr Asn Pro Asp Ala 

740 745 750 

Ala Thr Gin Asn Arg Arg Phe Gin Phe Thr Gin Asn Gin Lys Lys Glu 

755 760 765 

Asp Ser Lys Thr Ser Thr Ser Val Thr Ser Val Asn Gin Ala Ser Thr 

770 775 780 

Ser Arg Leu Glu Gly Leu Gin Ser Glu Asn His Arg Leu Arg Met Lys 
785 790 795 800 

He Thr Glu Leu Asp Lys Asp Leu Glu Glu Val Thr Met Gin Leu Gin 

805 810 815 

Asp Thr Pro Glu Lys Thr Thr Tyr He Lys Gin Asn His Tyr Gin Glu 

820 825 830 

Leu Asn Asp He Leu Asn Leu Gly Asn Phe Thr Glu Ser Thr Asp Gly 

835 840 845 

Gly Lys Ala He Leu Lys Asn His Leu Asp Gin Asn Pro Gin Leu Gin 

850 855 860 

Trp Asn Thr Thr Glu Pro Ser Arg Thr Cys Lys Asp Pro He Glu Aso 
865 870 875 880 

He Asn Ser Pro Glu His He Gin Arg Arg Leu Ser Leu Gin Leu Pro 

885 890 895 

He Leu His His Arg His Val Pro Pro Ser Phe Arg Val Met Val Ser 
900 905 910 

Gly Leu 



<210> 30 

<211> 965 

<212> PRT 

<213> Homo sapiens 



<400> 30 



Met 


Ala 


Ser 


Pro 


Arg 


Ser 


Ser 


Gly Gin 


Pro Gly Pro 


Pro 


Pro 


Pro 


Pro 


1 








5 










10 








15 




Pro 


Pro 


Pro 


Pro 
20 


Ala 


Arg 


Leu 


Leu 


Leu 
25 


Leu Leu 


Leu 


Leu 


Pro 
30 


Leu 


Leu 


Leu 


Pro 


Leu 


Ala 


Pro 


Gly 


Ala 


Trp 


Gly Trp Ala 


Arg 


Gly Ala 


Pro 


Arg 






35 










40 








45 






Pro 


Pro 
50 


Pro 


Ser 


Ser 


Pro 


Pro 
55 


Leu 


Ser 


He Met 


Gly 
60 


Leu 


Met 


Pro 


Leu 


Thr 


Lys 


Glu 


Val 


Ala 


Lys 


Gly 


Ser 


He 


Gly Arg Gly Val 


Leu 


Pro 


Ala 


65 










70 








75 










80 


Val 


Glu 


Leu 


Ala 


He 
85 


Glu 


Gin 


He 


Arg 


Asn Glu 
90 


Ser 


Leu 


Leu 


Arg 
95 


Pro 


Tyr 


Phe 


Leu Asp 


Arg 


Leu 


Tyr Asp 


Thr 


Glu Cys 


Asp 


Asn 


Ala 


Lys 


Gly 








100 










105 








110 


Leu 


Lys 


Ala 
115 


Phe 


Tyr 


Asp 


Ala 


He 
120 


Lys 


Tyr Gly 


Pro 


Asn 
125 


His 


Leu 


Met 


Val 


Phe 


Gly Gly 


Val 


Cys 


Pro 


Ser 


Val 


Thr Ser 


He 


He 


Ala 


Glu 


Ser 




130 










135 








140 










Leu 


Gin 


Gly 


Trp 


Asn 


Leu 


Val 


Gin 


Leu 


Ser Phe 


Ala 


Ala 


Thr 


Thr 


Pro 


145 










150 








155 










160 


Val 


Leu 


Ala 


Asp 


Lys 
165 


Lys 


Lys 


Tyr 


Pro 


Tyr Phe 

170 


Phe 


Arg 


Thr 


Val 
175 


Pro 


Ser 


Asp 


Asn 


Ala 


Val 


Asn 


Pro 


Ala 


He 


Leu Lys 


Leu 


Leu 


Lys 


His 


Tyr 








180 










185 








190 




Gin 


Trp 


Ly3 


Arg 


Val 


Gly 


Thr 


Leu 


Thr Gin Asp 


val 


Gin 


Arc- 


Phe 


Ser 






195 










200 








205 








Glu 


Val 


Arg 


Asn 


Asp 


Leu 


Thr Gly 


Val 


Leu Tyr Gly Glu Asp 


Tie 


Glu 



210 215 220 



-13- 



WO 99/61606 PCT/US99/1 1869 



He Ser Asp Thr Clu Ser Phe Ser Asn Asp Pro Cys Thr Ser Val Lys 
225 230 235 240 

Lys Leu Lys Gly Asn Asp Val Arg lie He Leu Gly Gin Phe Asp Gin 

245 250 255 

Asn Met Ala Ala Lys Val Phe Cys Cys Ala Tyr Glu Glu Asn Met Tvr 

260 255 270 

Gly Ser Lys Tyr Gin Trp He He Pro Gly Trp Tyr Glu Pro Se- Tro 

275 260 285 " 

Trp Glu Gin Val His Thr Glu Ala Asn Ser Ser Arg Cys Leu Ara Lvs 

290 295 300 

Asn Leu Leu Ala Ala Met Glu Gly Tyr He Gly Val Asp Phe Glu Pro 
305 310 315 320 

Leu Ser Ser Lys Gin He Lys Thr. He Ser Gly Lys Thr Pro Gin Gin 

325 330 335 

Tyr Glu Arg Glu Tyr Asn Asn Lys Arg Ser Gly Val Gly Pro Ser Lys 

340 345 350 

Phe His Gly Tyr Ala Tyr Asp Gly He Trp Val He Ala Lys Thr Leu 

355 360 365 

Gin Arg Ala Met Glu Thr Leu His Ala Ser Ser Arg His Gin Arg He 

370 375 380 

Gin Asp Phe Asn Tyr Thr Asp His Thr Leu Gly Arg He He Leu Asn 
385 390 395 40 0 

Ala Met Asn Glu Thr Asn Phe Phe Gly Val Thr Gly Gin Val Val Phe 

405 410 
Arg Asn Gly Glu Arg Met Gly Thr He Lys Phe Thr Gin Phe Gin Asp 

420 425 430 

Ser Arg Glu Val Lys Val Gly Glu Tyr Asn Ala Val Ala Asp Thr Leu 

43S 440 445 

Glu He He Asn Asp Thr He Arg Phe Gin Gly Ser Glu Pro Pro Lys 

450 455 460 

Asp Lys Thr He He Leu Glu Gin Leu Arg Lys He Ser Leu Pro Leu 
465 470 475 480 

Tyr Ser He Leu Ser Ala Leu Thr He Leu Gly Met He Met Ala Ser 

485 490 495 

Ala Phe Leu Phe Phe Asn He Lys Asn Arg Asn Gin Lys Leu He Lys 

500 505 510 

Met Ser Ser Pro Tyr Met Asn Asn Leu He He Leu Gly Gly Met Leu 

51S 520 525 

Ser Tyr Ala Ser lie Phe Leu Phe Gly Leu Asp Gly Ser Phe Val Ser 

530 535 540 

Glu Lys Thr Phe Glu Thr Leu Cys Thr Val Arg Thr Trp He Leu Thr 
545 550 555 560 ■ 

Val Gly Tyr Thr Thr Ala Phe Gly Ala Met Phe Ala Lys Thr Trp Arg 

565 570 575 

Val His Ala He Phe Lys Asn Val Lys Met Lys Lys Lys He He Lys 

580 585 590 

Asp Gin Lys Leu Leu Val He Val Gly Gly Met Leu Leu He Asp Leu 

595 600 60S 

Cys He Leu He Cys Trp Gin Ala Val Asp Pro Leu Arg Arg Thr Val 

610 615 620 

Glu Lys Tyr Ser Met Glu Pro Asp Pro Ala Gly Arg Asp He Ser He 
625 630 635 640 

Arg Pro Leu Leu Glu His Cys Glu Asn Thr His Met Thr He Trp Leu 

645 650 655 

Gly He Val Tyr Ala Tyr Lys Gly Leu Leu Met Leu Phe Gly Cys Phe 

660 665 . 670 

Leu Ala Trp Glu Thr Arg Asn Val Ser He Pro Ala Leu Asn Aso Ser 

675 680 685 

Lys Tyr He Gly Met Ser Val Tyr Asn Val Gly He Met Cys He He 

690 695 700 

Gly Ala Ala Val Ser Phe Leu Thr Arg Asp Gin Pro Asn Val Gin Phe 
70S 710 715 720 

Cys He Val Ala Leu Val He He Phe Cys Ser Thr He Thr Leu Cys 

725 730 735 

Leu Val Phe Val Pro Lys Leu He Thr Leu Arg Thr Asn Pro Asp Ala 

740 745 750 

Ala Thr Gin Asn Arg Arg Phe Gin Phe Thr Gin Asn Gin Lys Lys Glu 

755 760 765 

Asp Ser Lys Thr Ser Thr Ser Val Thr Ser Val Asn Gin Al« Ser Thr 

770 775 78C 

Ser Arg Leu Glu Gly Leu Gin Ser Glu Asn His Arg Leu Arg Met Lys 
785 790 795 80 0 
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